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Abstract 

We give an algebraic characterization of the tree languages that are 
defined by logical formulas using certain Lindstrom quantifiers. An im- 
portant instance of our result concerns first-order definable tree languages. 
Our characterization relies on the usage of preclones, an algebraic struc- 
ture introduced by the authors in a previous paper, and of the block 
product operation on preclones. Our results generalize analogous results 
on finite word languages, but it must be noted that, as they stand, they 
do not yield an algorithm to decide whether a given regular tree language 
is first-order definable. 



Classification: ACM: F.4.3, F.4.1. MSG: 03B70, 68Q70, 68Q45 

One of Bret Tilson's lasting contributions is the introduction (with John 
Rhodes) of the notions of block product and two-sided semidirect product, and 
their use in the structure theory of finite monoids. This tool was initially intro- 
duced to derive iterated decompositions of morphisms and to refine the wreath 
product-based Krohn-Rhodes decomposition of finite monoids [25|. It quickly 
found applications in formal language theory (see [Ml 133 ISHl HI 133 among 
others). One of the more fruitful applications of this work has been in the 
investigation of the logical aspects of automata theory (on finite words). For 
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instance, the expressive power of first-order formulas with a certain quantifier 
depth, can be captured by monoids which divide an iterated block product of 
scmilattices of the same length, see Straubing's book [55] , 

Automata are particularly well suited to discuss the behavior of terminating 
sequential systems (languages of finite words), and this field of research has 
benefited from the start from the well-established connection between automata 
and monoid theory. There is however also much interest in automata-theoretic 
descriptions of languages of other structures than finite words, corresponding to 
other natural ideas of implementation and other natural models of computation 
(infinite, branching, concurrent, timed, etc). This paper is a contribution to 
the investigation of an important problem of this sort: can we decide whether a 
regular tree language is first-order definable? Here trees are finite, ranked and 
ordered. The latter properties signify that the nodes of the trees are labeled 
with symbols of a given arity (the rank of the node), and the children of a node 
of rank r form a totally ordered set of cardinality r. A tree language is said to be 
regular if it is accepted by a classical (deterministic) bottom-up tree automaton. 

The notion of automata recognizability for (finite) word languages is eas- 
ily translated to an algebraic notion of recognizability, expressed in terms of 
monoids: the set of all words on a given alphabet A is a monoid (the free 
monoid A* over that alphabet), and one shows that a language is recognized by 
a finite state automaton if and only if it is the inverse image of a set, under a 
morphism from A* into a finite monoid. Moreover, if a language is recognizable, 
then there is a least finite monoid recognizing it, called its syntactic monoid. 
This point of view opens vast possibilities for the classification and the discus- 
sion of the properties of recognizable languages, which can be characterized in 
terms of the algebraic properties of their syntactic monoid, see Eilenberg's va- 
riety theory [TTl [251 H] ■ As the syntactic monoid of the language accepted by a 
given automaton is computable, this can lead to interesting decision algorithms. 

It is well-known (Biichi, 1960), that a language is recognizable if and only if it 
is definable in monadic second-order logic. It was also shown that a language is 
definable by a first-order formula if and only if its syntactic monoid is aperiodic. 
This statement is actually the combination of two classical theorems due to 
Schiitzenberger and to McNaughton and Papert. It can also be proved directly 
(as in [35]), using the Krohn- Rhodes decomposition theorem, which implies that 
a monoid is aperiodic if and only if it belongs to the least pseudovariety closed 
under block product and containing ?7i = {1,0}. As the syntactic monoid of 
a regular language is computable and as aperiodicity is decidable, it is also 
decidable whether a regular language is FO-dcfinablc. 

Considering logically defined sets of trees (or other discrete combinatorial 
structures) is just as natural as for words. But the literature on these questions 
shows that classification and decision results are much harder to reach, in part 
because we lack the versatile and powerful algebraic tool provided in the word 
case by finite monoid theory. The weakness of our understanding of automata 
theory for tree languages is highlighted by the fact that the decidability of first- 
order definability is still an open problem. 

For most discrete structures, there is no obvious algebraic structure that can 
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be used in lieu of monoids, or at least no algebraic structure that gives rise to the 
same wealth of structure theorems and variety characterizations, see j39j . In the 
tree case (for finite, ranked and ordered trees), several propositions can be found 
in the literature, see the work of Steinby, Salehi, Heuter, Podelski, Wilke, Esik, 
etclSllMllSIllSllIllllBinilll]- Until recently none was very convincing in 
terms of its capacity to characterize significant classes of languages, but there 
are recent encouraging results expressed in terms of minimal tree automata, 
that is, in terms of S-algebras by Benedikt and Segoufin [3], Bojanczyk and 
Walukiewicz [7], Esik [B] and Esik and Ivan [HIIiQ. In a previous paper [ioj , 
the authors introduce a new algebraic framework - the so-called preclones ~ to 
classify and discuss the properties of recognizable tree languages. 

It turns out that the setting of preclones makes it natural to discuss not only 
the recognizable sets of trees, but also recognizable sets of trees with variables. 
Variables can be seen as unlabeled leaves of the tree, and the rank of a tree is 
the number of such unlabeled leaves. Alternately, one can regard these leaves 
as labeled by particular letters {vi, 7J2, . . .}, in such a way that in a rank k tree, 
the variable leaves are labeled vi, . . . ,Vk from left to right. 

We verified in [18] that the notion of recognizability induced by the alge- 
braic structure of preclones coincides with the usual notion of recognizable tree 
languages, that the syntactic preclone of a recognizable tree language is com- 
pletely determined by the minimal deterministic bottom-up automaton of the 
language (all very reassuring facts), and that these notions are robust enough 
to allow for an Eilenberg-like development in terms of varieties of languages and 
pseudovarieties of preclones. 

In this paper, we use this algebraic framework to derive an algebraic char- 
acterization of first-order definable tree languages, and more generally, of the 
classes of tree languages determined by certain families of Lindstrom quantifiers. 
This characterization requires the introduction of a block product operation on 
preclones, a complex algebraic operation which generalizes Tilson's block prod- 
uct of monoids. Our main result implies that the first-order definable tree 
languages are exactly those languages whose syntactic preclone sits in the least 
pseudovariety of preclones closed under block product and containing T3 , a very 
simple preclone whose properties were discussed in [TH| and which can be viewed 
as an analogue of the monoid Ui. This result was announced without proof in 
the authors' communication at FST-TCS [17]. 

As it is, our result does not yield an algorithm to decide whether a given rec- 
ognizable tree language is first-order definable. This question is briefly discussed 
in the conclusion of the paper, but whatever the case may be, such a decidability 
result remains one of the main goals in this field. Our result however suggests 
the feasibility of an algebra-based solution. 

The plan of this paper is as follows. Section [1] summarizes the essential 
properties of preclones that are necessary for this study. Section [2] introduces 

^ After our results were announced in 2003 |17) and while this paper was in preparation or 
under refereeing, an interesting approach of tree languages in terms of so-called forest algebras 
was introduced by Bojanczyk and Walukiewicz. See the conclusion of this paper for a brief 
discussion. 
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the logical apparatus we will use, including the Lindstrom quantifiers and the 
closure properties of the associated operators on classes of languages. Finally, in 
Section [31 we introduce the block product operation on preclones, and we prove 
our main results. The paper closes on a conclusion where we discuss certain 
questions raised by these results. 

Notation Let ?? > 0. We denote by [n] the set {1, ...,??} if n > 0, the empty 
set if n = 0. 

1 The algebraic framework 

Throughout the paper, we will be discussing sets of finite ranked trees, that 
is, trees in which the set of children of each inner node is linearly ordered; E 
designates a ranked alphabet, that is, S = (S„)„>o where the I]„ are pairwise 
disjoint sets and IJ^^ E„ is finite. An element of S„ is said to have rank n. 

This section summarizes the main facts relative to the algebraic framework, 
which we will use to establish our main theorem. Most of these results are taken 
from the authors' earlier paper |18] and are stated here without proof. 

1.1 Preclones 

A preclone is a many-sorted algebra S — ((>S'n)„>o, •, 1). The elements of S„ 
are said to have rank n, the element 1 belongs to 5*1, and the composition 
operation • associates with each / £ 5„ and n-tuple g = (gi,...,g„) (with 
Qi 6 Srui, 1 < i < n), an element f • g £ S*™ where m = Moreover, 1 

and • satisfy the axioms given below. 

For convenience, a tuple g as above is written <? = 51 © • ■ • ® Pn, we say that g 
has total rank m, written rsnk^g) = m, and we let Sn^m be the set of n-tuples of 
total rank m. Note that Si^m = Sm for all m. We also write n for the ©-sum of 
n copies of 1, so that n G Sn.n- The axioms defining preclones are the following: 

1 • / = / = / • n for each / £ Sn, n>0, 
and if ■ g)-h ^ f ■ {gi - hi ® ■ ■ ■ ® g^ ■ K) 

where f £ Sn, g ~ ©iLi 5* with each gi £ Srm , h = ®^jLi with each hj £ Sp. , 
m — mi, hi equal to the ©-sum of the mi first /ij's, /12 equal to the ©-sum 
of the 7712 next /ij's, etc. 

It is interesting to remark that is naturally equipped with a monoid 
structure. 

Sub-preclones of preclones are defined in the natural way. A morphism of 
preclones, ip: S ^ T , is a rank preserving map, which also preserves the unit 
element 1 and the composition operation. Similarly, a congruence is an equiv- 
alence relation, that relates only elements of equal rank, and which is stable 
under the composition operation. The quotient of a preclone by a congruence 
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is naturally endowed with a preclone structure, and the projection map is an 
onto morphism. 

The least sub-prcclone of a preclone S, containing a given subset A is called 
the sub-preclone of S generated by A. If this preclone is S itself, we say that 
S is generated by A. A preclone is finitely generated if it admits a finite set of 
generators. 

A preclone S is said to be finitary if each Sn is finite. Observe that as soon 
as some Sk, k > 2 is non-empty, then infinitely many Sk are non-empty, and 
hence S is not finite. 

1.2 Examples of preclones 

The following examples will be essential for our study. 

Trees and free preclones Let S be a ranked alphabet. The free preclone 
generated by S, written EM, can be described as follows (see [HI Section 
2.2]). Let {vk)k>i be a sequence of variable names: we let SM„ be the set of 
finite trees, whose inner nodes are labeled by elements of S (where a rank k 
letter labels a node with k children), whose leaves are labeled by elements of 
Eq U {vi, . . . , Vn}, and whose frontier (the left to right sequence of leaf labels) 
contains exactly one occurrence of wi, . . . , w„, in that order (that is, belongs 
to EqIIiEq • • • u„Eq). The elements of EAf„ are called trees of rank n or n-ary 
trees over E. 

If t is such a tree, we let NV(i) be the set of nodes of t with a label in E (NV 
stands for non-variable labeled). In the logical discussion to follow (Section [2|), 
we will give the nodes in NV(<) a particular role. 



Figure 1: / G EA/„, .g = .gi © • • • © .g„ G EA/„^j„ and two views of f ■ g 

EAf is a preclone for the following operations: if / S EM„ and gi e EAfm; 
(1 < i < n), then the composite tree f ■ (gi (B ■ ■ ■ (B g„) is obtained by replacing 
the Ui-labeled leaf of / by the root of gi, and by renumbering the variable labeled 
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leaves of the resulting tree with consecutively indexed f^ 's, see Figure [TJ The 
unit element 1 is the graph in EMi consisting of a single node labeled vi . 

Each element of E can be identified with an element of EM: the letter 
a S E„ is identified with the tree with n + 1 nodes, consisting of the root, 
labeled cr, and the n children of the root, labeled vi, . . . ,v„ in this order. 

The elements of rank 0, EMq, are the ordinary E-labeled trees. 

Example 1.1 As discussed in the introduction, our results can be seen as gen- 
eralizations to trees of known results on recognizable word languages. This 
meta-statement can be made precise in the following fashion: if A is a finite 
(unranked) alphabet, we can view A as a ranked alphabet, all of whose ele- 
ments have rank 1. Then the elements of AMi can be seen as the words of the 
form wvi, where w £ A* . In particular, the monoid AMi is isomorphic to, and 
will be identified with the free monoid A* . The sets AM„ (n 7^ 1) are empty. □ 

Preclone of transformations, preclone of an automaton If Q is a set, 
let Tn{Q) be the set of n-ary transformations of Q, that is, the set of mappings 
— > Q. Let also T{Q) = {Tn{Q))n>o- Composition of mappings endows T{Q) 
with a preclone structure in the following sense: if / G T„((5), gi G T,„;((3) 
(1 < i < n) and m = J^i ™i> then / ■ (51 © ■ • ■ © gn) maps (gi, . . . , q,n) to 

/(.9l(9l7 • ■ • : 9Tni),g2((7mi+l, • ■ • , grni+ma), ■ • ■ ,gn{qm-m„+l, ■ ■ ■ , Qm)) ■ 

If E is a ranked alphabet and Q is a E-algebra, each element a G E„ de- 
termines naturally an n-ary transformation of Q. The sub-preclone of T{Q) 
generated by E is called the preclone associated with Q. 

Example 1.2 Let A be an unranked alphabet, viewed as a ranked alphabet as 
in Examplc ll.il An A-algebra Q is simply a set, equipped with an action of A, 
that is, a deterministic complete automaton. Each letter a G A then defines a 
mapping Q Q. Thus the preclone associated with Q has elements of rank 1 
only, which form the usual transition monoid of the automaton, see [TTJ [52] . □ 

We note that E-algcbras are natural objects in our context: a deterministic 
complete bottom- up tree automaton accepting trees in EAfp (see [S]), with state 
set Q, can be described as a finite E-algebra Q, equipped with a set F C Q of 
final states. 

The preclones Tg and Tp Let B be the Boolean semiring B = {true, false}, 
and let Tg be the subset of T(B) whose rank n elements are the n-ary or function 
and the n-ary constant true, written respectively or„ and true„ (by convention, 
org is the nuUary constant falseo). Then Tg is a preclone, which is generated by 
the binary or2 function and the nuUary constants trueo and falsep. The set B 
equipped with these 3 generators can be viewed as a finite tree automaton, and 
Tg is the preclone associated with this automaton. 
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It is interesting to note that the rank 1 elements of Tg form a 2-element 
monoid, isomorphic to the multiphcative monoid {0, 1}, and known as Ui in the 
hterature on monoid theory, e.g. [22] • 

Similarly, if p > 2 is an integer and Bp = {0, 1, . . . ,p—l}, let Tp be the subset 
of T(Bp) whose rank n elements {n > 0) are the mappings fn,r'- (j^ii ■ ■ ■ 5 ^n) ' — ^ 
ri + ■ ■ ■ + ?'„ + r mod p for < r < p. Again. Tp is a finitely generated preclone, 
generated by the nuUary constant 0, the unary increment function /i.i and the 
binary sum f2fi. Moreover, Tp can be seen as the preclone associated with a 
p-element automaton, and its rank 1 elements form a monoid isomorphic to the 
cyclic group of order p. 

Preclone-generator pairs If S* is a preclone and A is a set of generators of 
S, we say that (S, A) is a preclone-generator pair, or pg-pair. A pg-pair (S*, A) is 
said to be finitary if S is finitary and A is finite. The notions of sub-pg-pair and 
morphisms of pg-pairs are defined naturally: (S*, A) is a sub-pg-pair of {T,B) 
is A C_ B (so that S is by construction a sub-preclone of T); and a morphism 
of pg -pairs tp: {S,A) — > {T,B) is a preclone morphism from S" to T such that 
ip[A) C B. 

1.3 Syntactic preclones 

Let 5" be a preclone and let L C Sk- We say that L is recognizable if there exists 
a morphism ip: S —>■ T into a finitary preclone and a subset P Q T^ such that 
L ~ ip^^{P). Then we say that L is recognized by T, and by the morphism ip. 
If (S, A) and (T, B) are pf/-pairs and is a morphism between these pf/-pairs, 
we say that L is recognized by (T, B). 

Let f €z Sn- A context of f in L is a pair (w, u) where u is an element of S 
and 1; is an n-tuple of elements of 5, such that / can be inserted under u and 
above v to produce an element of L, see Figure [2l We would like this condition 
to read u ■ f ■ v E L, but it has to be a little more technical, to specify where 
precisely / is attached under u. 

Formally, for each fc > 0, n > 0, let /fc.„ be the set of n-ary contexts in 
Sk, that is, the set of tuples of the form (u, fci, w, ^2), where ki,k2 > and 
fci + fc2 < k, u e Ski+i+k-i and v € Sn,i with £ = k — {ki + /C2), see Figure [2l If 
L C Sk, f £ Sn, we say that a context {u, ki, v, fc2) G Ik,n is an L-context of f 
if u • (ki © / • w © k2) G i. We also let the set of 0-ary contexts in Sk be the 
set Ikfi of tuples {u, ki, 0, fc2) where fci, fc2 > and u e S'fcj+i+fej (the symbol 
is introduced here to preserve the uniformity of notation). We say that such a 
context is an L-context of / G 5*0 if u • (ki © / © k2) = u ■ (ki © / ■ © k2) S L. 

Next we say that elements of /, g are L-equivalent, written / ~i g f and g 
have the same L-contexts. The relation is a congruence, called the syntactic 
congruence of L, the quotient preclone S/ is called the syntactic preclone 
of L, and the projection morphism is the syntactic morphism. Finally, if A is 
a set of generators of S, the syntactic pg-pair of L is the pair (T, B) where 
T ~ S/ and B is the image of A in the syntactic morphism. We note the 
following result, proved in [iHl Proposition 3.2]. 
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/C - (fcl + fc2) 



Figure 2: an n-ary context in SAf^; is it an L-context of / (/ S E„)? 

Proposition 1.3 Let S be a preclone, k > and L C_ Sk. Then the following 
statements hold. 

• A morphism of preclones recognizes L if and only if it it can be factored 
through the syntactic morphism of L. 

• If T is a sub-preclone or a quotient of T' , and if T recognizes L, then so 
does T' . 

• L is recognizable if and only if its syntactic preclone is finitary. 

• The analogous statements hold for pg -pairs. 

Wc will be primarily concerned with the case where 5 is a finitely gener- 
ated free preclone, S = SM with E a ranked alphabet. The subsets of each 
EMfc are called tree languages. In that case, the notion of recognizable tree lan- 
guages defined above coincides with the classical notion of recognizability, and 
in particular, the syntactic preclone of a recognizable tree language L C EMq 
coincides with the preclone of the minimal automaton of L, see [IHl Section 3.2]. 
It is interesting to note that the syntactic E-algebra of L [M] is exactly the rank 
part of the syntactic preclone of L, and that the syntactic tree monoid [25] 
of L is the monoid of rank 1 elements of its syntactic preclone. In particular, if 
L C EMp, then the syntactic preclone of L is finitary if and only if its rank 
part is finite. 

We now consider two important examples. In each, the alphabet is a ranked 
Boolean alphabet A, that is, a ranked alphabet such that whenever A„ ^ 0, 
then A„ = {1„,0„}, see [18l Section 3.3] for more details. 

Example 1.4 For fc > 0, let /^a:(3) be the set of trees in AAf^ containing 
at least one node labeled 1„ (for some n). Then Kk{3) is recognizable and 
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its syntactic preclone is defined in Section [T^ see [TH Section 3.3]. More 
generally, let ip: SA/ — > Tg be a morphism, with S an arbitrary ranked alphabet. 
Let 

= \J{a e I]„ I ip{a) = or„} 

n 

and let E*^^) be the complement of E^"' in E. Then the subsets of EA/fc recognized 
by (p are 0, EM^, (^"^(orfc) = E("U'ffe and (^"^(truet), the set of trees in EA4- 
with at least one occurrence of a letter in E*^^'. 

Similarly, if p,r are integers with < r < p and if A'fe(3p consists of the 
trees in AMk such that the number of nodes labeled 1„ (for some n) is congruent 
to r modulo p, then A'fe(Elp is recognizable and its syntactic preclone is Tp. If 
if: EM — > Tp is a morphism, then the subsets of EAf^ recognized by ip are the 
finite unions of the if~^{fn,r) (0 < r < p). For each such r, let 

E('-) ^ []{a G E„ I p{a) = 

n 

For each t £ EAf^, let be the number of nodes in NV(t) labeled by a letter 

in E^''), and let w{t) = I]rrwr(t). Then <y9"^(/„,r) is the set of all t G EMfc 
such that w{t) — r (mod p). □ 



1.4 Varieties of tree languages and pseudovarieties 

A pseudovariety of preclones is a class of finitary precloncs which is closed 
under taking finite direct products, sub-preclones, quotients, finitary unions of 
w-chains and finitary inverse limits of w-diagrams, see [HI Section 4]. Here, we 
say that a union T = 1J„>q T*^"-* is finitary if T is finitary. Finitary inverse 
limits of an oj-diagram of the form (^("): r("+i) T'"' are defined similarly. 

The definition of a pseudovariety of pg -pairs is similar: it is a class of fini- 
tary pg-^aiva which is closed under taking finite direct products, sub-preclones, 
quotients, and finitary inverse limits of w-diagrams (there is no need to consider 
unions of a;-diagrams, see [IHl Section 4.4]). 

We note the following proposition [TSl Corollary 4.22]. 

Proposition 1.5 Let \ he a pseudovariety of preclones and let S be a finitary 
preclone such that, for all s =/= t Cz S , there exists a morphism ip from S into a 
preclone in V with ip{s) ^ ^{t)- Then 5 G V. 

An analogous statement for pseudovarieties of pg-pairs also holds. 

Proposition 1.6 LefV be a pseudovariety of pg -pairs and let {S,A) be a fini- 
tary pg-pair such that, for all s ^ t £ S, there exists a morphism tp from {S, A) 
into a pg-pair in V with ip{s) ^ 'p{t)- Then [S, A) G V. 

Proof. This statement is proved in the same fashion as Proposition 11.51 (see 
[H]), using also [H Proposition 4.23]. □ 
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If K is a class of finitary preclones (resp. pg-pairs), there exists a least 
pseudovariety containing K, which is said to be generated by K and is denoted 
by (K), see [HJ Section 4.2]. We record in particular the following results, which 
follow from [T51 Propositions 3.3 and 4.16, Corollary 4.8]. 

Proposition 1.7 Let K 6e a class of finitary preclones (resp. pg-pairs) and let 

V he the pseudovariety generated by K. The syntactic preclone (resp. pg-pair) 
of a recognizable tree language belongs to V if and only if it is a quotient of a 
sub-preclone (resp. sub-pg-pair) of a direct product of elements o/K. 

Proposition 1.8 A pseudovariety of preclones (resp. pg-pairs) is entirely de- 
termined by the syntactic preclones (resp. pg-pairs) it contains. 

Note that pseudovarieties of preclones can be seen as particular examples 
of pseudovarieties of pg-pairs, in the sense of Proposition 11.91 belo^ivFI. If K is a 
class of p5-pairs, we let precl(K) be the class of preclones S such that {S, A) G K 
for some set A. Conversely, if L is a class of preclones, we let pgp(L) be the 
class of finitary pg-pairs {S, A) such that 5* S L. Let us say that a class K of 
pg-pairs is full if membership of a pg-pair (5, A) in K depends only on S; that 
is, K = pgp(precl(K)). 

Proposition 1.9 A pseudovariety V of pg-pairs is full if and only if there exists 
a pseudovariety W of preclones such that V = pgp(W) and in that case, W is 
the pseudovariety generated by precl(V). 

Moreover, ifK. is a full class of pg-pairs and V is the pseudovariety gener- 
ated by K, then V is full as well and precl(K) and precl(V) generate the same 
pseudovariety of preclones. 

Proof. Let W be a pseudovariety of preclones and let V = pgp(W). The class 

V is full by definition. Let us first verify that it is closed under taking sub-pg- 
pairs, quotients, finite direct products and finitary inverse limits of w-diagrams. 
Suppose for instance that {S, A) is a sub-pg-pair of (T, B) with T G W. Then 
S* is a sub-preclone of T, so S' £ W and {S, A) G V. The verification is equally 
routine for quotients and finite direct products. As for inverse limits of lo- 
diagrams, it was shown [H Proposition 4.23] that if {S,A) = lim„(5("), A^")), 
then S = lim„ S'("). Thus, if the (S'("), A^")) are in V, then the S'^") are in W 
and hence S* G W and {S, A) G V. 

Now let us show that if L is a class of finitely generated finitary preclones, 
then pgp((L)) C (pgp(L)). Let {U,C) be a finitary pg-pair with U G (L): we 
want to show that {U,C) G (pgp(L)). Combining technical results from [18] 
(namely Propositions 4.5 and 4.16), we may assume that there exist preclones 
S*'^',. . . , S*^"-* in L such that U = ^p{T') for some morphism ip-.T' ^ U where 
T' is a sub-preclone of Hi ^'--^^ . Since L consists of finitely generated preclones, 
let a''' be a finite set of generators of 5^*-'. Let B be a finite subset of T' such 
that ip{B) = C and for each i, let i?^*) be the projection of B onto the i-th 

■^The notion of pseudovarieties of pg-pairs could be generalised to the notion of varieties 
of stamps as is done for word languages, see 1361 124| . 
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component. Finally, let T be the sub-preclone of T' generated by B. Then 
{U,C) = (p{T,B) and (T, B) is a sub-^-pair of ll^{S^'\ A^''^ U B^*)). This 
establishes that {U,C) G (pgp(L)). 

Let now K be a full class of finitary pg-pairs and let W = (precl(K)). 
We verify that (K) — pgp(W), which implies that (K) is full. Indeed, since 
K is full, we have K = pgp(precl(K)) and hence (K) = (pgp(precl(K))) C 
(pgp(W)). The first part of the proof establishes that pgp(W) is a pscudovariety, 
so (K) C pgp(W). Moreover, the discussion in the previous paragraph, applied 
to L = precl(K), shows that pgp(W) C (K). The expected equality follows. 

Applying this result to K = V, a full pscudovariety of pg-pairs, and to W = 
(precl(V)), shows that V = pgp(W), as announced. Finally, if V = pgp(W') 
for some other pscudovariety of preclones W', then W and W' have the same 
finitely generated elements, and hence must be equal by Proposition 11.81 This 
concludes the proof of the proposition. □ 

Before we discuss varieties of tree languages, let us define quotients of tree 
languages. Let L C SM^., let ki,k2 be integers with ki + k2 < k and let 
u G EMfcj^_|.i_|.fe2 . The left quotient of L by (u, ki, k2) is the subset oiYiMf~_j.^_f.^ 

(u, ki,k2)-^L = {/ e m'Ik-k^-k2 I M • (ki © / © ks) G L}. 

If w G YiMn^ki then the right quotient of L by z; is 

L«-i-{/GSM„|/-i;GL}. 

Remark 1.10 With the above notation, {u, ki,v, k2) is an L-context of an 
element / if and only if / G (^{u,ki,k2)^^L'jv^^ = {u, ki, k2)^^ [L^ki © w © 

k2)-i). 

Moreover, if (u, fci, w, /c2) and (u', fci, f ', /s2) are contexts such that u u' 
and V v', then ((u, fci, A:2)^^L)u^^ = ((u', ki, k2)~^L^v'~ . □ 

We say that a morphism ip: T.M Y,' M is a literal morphism if ip(T.) C 
E'. A variety of tree languages (resp. a literal variety of tree languages) is a 
collection V = (Vi;.fc)i;.fe, where S runs over all ranked alphabets, k runs over 
non-negative integers, such that each Vs,fc is a Boolean algebra of recognizable 
languages in TiMk , and V is closed under quotients and under inverse morphisms 
(resp. inverse literal morphisms) between free preclones. In particular, every 
variety of tree languages is a literal variety. 

If V is a pscudovariety of preclones (resp. pg-pairs), we let var(V) = (Vs^fc) 
be such that Vs.fe is the class of languages in SM^. with syntactic preclone 
(resp. pg-pair) in V. If V is a variety (resp. literal variety) of tree languages, 
let psv(V) be the class of finitary preclones (resp. finitary pg-pairs) which only 
accept languages in V. The following result was proved in |18] . 

Theorem 1.11 The mappings var and psv are mutually inverse lattice isomor- 
phisms between the lattice of pseudovarieties of preclones (resp. pg-pairs) and 
the lattice of varieties (resp. literal varieties) of tree languages. 
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We note the following corollary of Theorem ll.lli which will be used in the 
sequel. 

Corollary 1.12 Let V he a literal variety and let V he the corresponding pseu- 
dovariety of pg -pairs. Then V is full if and only ifV is a variety. 

Example 1.13 Let (Tg) be the pseudovariety of preclones generated by Tg, and 
let V be the corresponding tree language variety. Then a language L C SAf^ 
is in Vs,A: if and only if i is a Boolean combination of languages of the form 
E'Mfc, E' C E, see Example O and [H Section 5.2.1]. □ 

More complex examples are discussed in [18l Section 5.2], and the main 
results of this article provide further examples. 

2 Logically defined tree languages 

Let E be a ranked alphabet. We will define tree languages by means of logical 
formulas. We consider the atomic formulas of the following form 

Pcr{x), x<y, SucCi{x,y), root(a;), maxij{x), leftj(a;) and nghtj{x), 

where ct £ E, i,j are positive integers, i is less than or equal to the maximal 
rank of a letter in E, and x,y are first-order variables. If fc > 0, subsets of 
EMfe will be defined by formulas of rank k, composed using atomic formulas 
with j e [fc], the Boolean constants false and true, the Boolean connectives 
and a family of generalized quantifiers called Lindstrom quantifiers, defined in 
Section [2.11 below. As usual, each quantifier binds a first-order variable (within 
the scope of the quantifier), and variables that are not bound are called free. 
A formula without free variables is called a sentence. We denote by Lind the 
logic defined in this fashion. 

When a Lind-formula is interpreted on a tree t £ EAffe, first-order variables 
are interpreted as nodes in NV(i) — and we assume t 7^ 1, so that NV(f) is non- 
empty. Then Pa{x) holds if x is labeled a (cr G E), x < y holds if y is a proper 
descendant of x, and SucCi(x, y) holds if y is the i-th successor of x. Moreover, 
root(x) holds if x is the root of t and maxj.j {x) holds if the i-ih successor of x 
is labeled by Vj, the j-th variable. Finally, \eftj{x) (resp. nghtj{x)) holds if the 
index of the highest (resp. least) numbered variable labeling a leaf to the left 
(resp. right) of the frontier of the subtree rooted at x is j, see Figure [3l The 
interpretation of Lindstrom quantifiers is described in Section 12.11 

Recall that formally, an interpretation is a mapping A from the set of free 
variables of a formula ip (or from a set containing the free variables of ip) to the 
set NV(t) of E-labeled nodes of a tree t. If t satisfies (p with this interpretation, 
we say that (t, A) satisfies ip and we write (t, A) \= ip. If (/3 is a sentence, we 
simply write t \= ip. 
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Figure 3: \ehh{x) A nghtj{x) 

Remark 2.1 In Lind- formulas, first-order variables are never interpreted as 
one of the Wi-labeled leaves. In fact, as far as logical constructs go, these partic- 
ular leaves are not considered as proper nodes of the tree, but rather as place 
markers - which explains the fact that they are labeled by their position in the 
left-to-right order, and may be relabeled appropriately when trees are composed. 

When we deal with traditional trees, that is, trees in SAfo, this peculiarity 
disappears, and we observe that in that case, our atomic formulas {Pa, <, root 
and the SucCi) are the atomic formulas of the usal logic on rooted ranked trees 
[37]. □ 

Example 2.2 Let A be an unranked alphabet, viewed as a ranked alphabet as 
in Example 11.11 Then AMi is equal to the set A*vi, and is isomorphic to the 
free monoid A*. In this situation, the boundary of the trees in AM consist of 
a single node, labeled vi, that is lefti(a;) and right]^(a;) always evaluate to false. 
Thus the relevant atomic formulas are Pa{x) {a & A), x < y, Succi(a;, y), root(a;) 
and maxi^i(x). Note that in this case, root(a;) is the predicate usually denoted 
by min(a;) (or x = min) and maxi_i(x) is the predicate max(a;) (or x = max). 
That is, we have the same atomic formulas as in Biichi's classical sequential 
calculus [23l [37l [35]. The condition t 1 imposed to interpret formulas, is 
equivalent to the fact that logical formulas arc not interpreted on the empty 
word. □ 

Next to the atomic formulas defined above, we also use the following short- 
hand notation. Let fc > and let lefto(x) be the formula of rank k lefto(x) = 
AjG[fc] ~'l6ftj(a;). Then lefto(a;) holds if no leaf situated to the left of the frontier 
of the subtree rooted at x, is labeled by a variable. We observe that for different 
values of k, we get different formulas lefto(a;), and our notation assumes that k 
is clear from the context. 

Similarly, if k is clear from the context, we define rightj,_|_]^(a;) to be the 
formula of rank k rightfc_|_i(a;) = Aje[fc] ~'i'ightj(a;). Its meaning is that no leaf 
situated to the right of the frontier of the subtree rooted at x, is labeled by a 
variable. 
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2.1 Lindstrom quantifiers 

Before we give formal definitions, we discuss an important example. 

Example 2.3 Let us consider the first order formula 3x ■ ip{x), where (p is a 
formula with free variables in a set F U {x} (x ^ Y). Let X:Y ^ NV(t). Recall 
that {t, A) \= 3x-(p a there exists a node v in NV(<:) such that {t, [X;x^v]) \= ip. 
For convenience, let A„ denote the interpretation [X;x t— » v]. We can express 
the satisfaction of 3a; • iy9 in the following, more generalizable fashion: we label 
each node v £ NV(<) with 1 if (i. At,) \= ip, with otherwise (the variable labeled 
nodes arc left unchanged). If t\ denotes the resulting Boolean-labeled tree, then 
{t, A) \^ 3x ■ ip a and only if tx belongs to the set of trees with at least one 1 
label. 

To be formally accurate, the nodes of tx must be labeled by a ranked alpha- 
bet, that is, we need to have, for each rank n, a letter 1„ and a letter 0„. The 
definition of Lindstrom quantifiers below generalizes this example. □ 

Let A be a ranked alphabet containing letters of rank n for each n such that 
E„ 7^ and let {ps)s£A be a family of rank k formulas on E-trees. We say that 
this family is deterministic with respect to a first-order variable x if for each tree 
t € SMfe, for each integer n, and for each interpretation A of the free variables 
in the ips mapping x to a rank n node of t, then (i, A) satisfies exactly one of 
the (ps, S G An- Whenever needed, we will also assume that x is not bound in 
any of the ipg. 

Example 2.4 If A = E, a very simple example of such a family is given by 
letting ps{x) = Ps{x) for each S £ A. □ 

Example 2.5 Another natural example is given over a ranked Boolean alpha- 
bet A, that is, an alphabet such that whenever A„ ^ 0, then A„ ~ {!„, On}. If 
for each such n, (/3o„ is logically equivalent to ~'Pi^ , then {ps)seA is deterministic 
with respect to any first order variable x. 

In later examples, when dealing with ranked Boolean alphabets, we will write 
ipn instead of pi^ and we will assume that (/5o„ = ~^Pn- Then a deterministic 
family will simply be written ((^„)„. □ 

With this notion, we define (simple) Lindstrom quantifiers, a definition adap- 
ted from [5D1[TU] to the case of finite trees. Let K C AMk be a language of rank 
k trees and let {(ps)s£A be a family of rank k formulas which is deterministic with 
respect to x. Then the quantified formula Qkx ■ {p5)seAi where the quantifier 
Qk binds the variable x, is interpreted in the following manner. 

Given a tree t S EMj. and an interpretation A of the free variables in the 
(ps except for x, we construct a tree t\ € AMk as follows: t and t\ have the 
same underlying tree structure with the same variable-labeled nodes, that is, the 
same set of nodes and the same relations <, Sucq, root, maxij, \eftj and right^). 
Moreover, for each rank n node v of t (for some n), let A„ be the interpretation 
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[A, X v]: then the node v in t\ is labeled by the unique element 5 G A„ such 
that (t, A„) satisfies tps- The tree i\ is called the characteristic tree determined 
by t, A and the formulas Lp^- If the ips have no free variable other than we 
write i for i\. Finally, we say that (t, A) satisfies Qkx ■ {(ps)seA if ^^a G K. 

Remark 2.6 With the above notation, we note that {t, A^) \= fs{x) if and only 
if (t\, [x ^ v\) Ps{x). Since x is the only free variable in Ps{x), this is also 
equivalent to (^a, A.„) [= Ps{x). □ 

Example 2.7 Suppose that A = S and (ps = Ps{x) as in Example 12.41 If 
t £ SMfe, it is easily verified that the trees t and t are equal. In particular, if 
K C AMfc is a language of fc-ary trees, then t satisfies Qkx ■ {(ps)seA if and 
only at € K. □ 

Example 2.8 Let A be an unranked alphabet, seen as a ranked alphabet as 
usual, suppose that A^ = for all fc ^ 1 and let K C AMi. Then K can also 
be seen as a word language since AMi is isomorphic to the free monoid A|, 
and the logic Lind is analogous to the logic for word languages studied by Esik 
and Larsen in [16] (the latter does not include min and max among its atomic 
formulas). □ 

In the next examples, A is a ranked Boolean alphabet such that A„ is non- 
empty whenever E„ is, and {^n)n is a family of formulas which is deterministic 
with respect to a first order variable x, see Example [ 



Example 2.9 Let K = Kk{3) denote the set of all trees in AMk containing at 
least one node labeled 1„ (for some n), see Example 11.41 Then the Lindstrom 
quantifier Qk is a generalization of the existential quantifier, as indicated in 
Example El 

More precisely, (i. A) satisfies Qkx ■ {ipn)n if and only if there exists a node 
V G NV(i) such that (t, \^) satisfies (pm where n is the rank of v and A„ is the 
interpretation [A,x ^-^ v\. 

Let finally A be an unranked alphabet, viewed as a ranked alphabet as in 
Example 11.11 and suppose that fc = 1. Then (t. A) \= Qkx • (^Pn)n (where t is 
viewed as a tree in AMi) if and only if {t, A) ^ 3x ■ (pi{x) (where t is viewed as 
a non-empty word in A*). □ 



Example 2.10 In the same manner as in Example 12. 9i if p > 1, r < p and 

K = K{3p) denotes the set of those trees in AMk such that the number of 
nodes labeled 1„ (for some n) is congruent to r modulo p (see Example II .41) . 
then the Lindstrom quantifier Qk is a generalization of a modular quantifier. 

More precisely, {t, A) satisfies Qkx ■ {'Pn)n if and only if, for some n, the 
number of nodes v S NV(t) such that (<, A^,) satisfies ^Pnix) (where n is the rank 
of v) is congruent to r mod p. 

If A is an unranked alphabet, then {t, A) \= Qkx ■ {(pn)n (where t is viewed 
as a tree in AMi) if and only if (t, A) |= 3pX ■ ipi{x) (where t is viewed as a 
non-empty word in A*). □ 



15 



Example 2.11 Let K = iffc(3path) be the set of all trees in AM^ such that 
all the nodes along at least one path from the root to a leaf are labeled 1„ (for 
appropriate values of n) . 

Then (t, A) satisfies Qkx ■ {fn)n if and only if there exists a root-to-leaf path 
such that, for every node v S NV(t) along this path, (t, At,) ^ (pn{x) (where n 
is the rank of v). 

If A is an unrankcd alphabet, then {t, A) |= Qkx ■ {(pn)n (where t is viewed 
as a tree in AMi) if and only if {t,X) ^ V.t ■ (pi{x) (where t is viewed as a 
non-empty word in A*). □ 



Example 2.12 Let Kk^^next) be the set of all trees in AAl^ such that the 
children of the root are labeled 1„ (for the appropriate n). 

Then [t, A) satisfies Qkx ■ (Vn}n if and only if, for every child v of the root, 
(t, Xy) \= ifnix) (where n is the rank of v). 

If A is an unranked alphabet, then (t. A) |= Qkx- {ipn)n (where t is viewed as 
a tree in AMi) if and only if {t, A) ^ 3x ■ (Succ(l, x) A(pi{x)), — more formally, 
(t. A) \= 3x ■ ((Vj/ • (min(?/) Succ(j/, .t))) A (pi{x)) — (where t is viewed as a 
non-empty word in A*). 

Other next modalities can be expressed likewise, e.g., requesting that at 
least one (resp. an even number, etc.) of the children of the root satisfies the 
appropriate ipn. □ 



2.2 The language associated with a Lind-formula 

Let ip he a. Lind-sentencc of rank k over E. We denote by L^p the set of trees 
in TiMk that satisfy ip, and we say that L^p is defined by the formula ip. 

For a class K. of tree languages, we let Lind(/C) denote the fragment of Lind 
consisting of the formulas in which all Lindstrom quantifiers are of the form Qk 
with K G /C. If is a Lind(/C) sentence, we say that L^p is 'Lind{IC)- definable, 
and we let Cind{lC) denote the class of Lind(/C)-definable tree languages. 

Example 2.13 Let K.^ be the class of all the languages of the form Kk{3) 
on a Boolean ranked alphabet. In view of the discussion in Example I2.9[ it 
is reasonable to say that Lind{K,^) is exactly the class of FO-definable tree 
languages. Examples l2.9l and l2.10l show that if /Cg.mod is the class of all languages 
of the form Kk{3) or Kk{3'p), then Cmd{IC^^aiod) is the class of (FO + MOD)- 
definable tree languages. □ 



It will be useful to associate a tree language also with the Lind-formulas that 
contain free variables (as is done in [351 Section II-2] for word languages). Let Z 
be a finite set. We extend S to the ranked alphabet S^, whose set of letters of 
rank m (m > 0) is Sm x T-'{Z). We identify each cr £ E with the pair (cr, 0) g S^. 
An element z € Z is said to occur in t G SzAf at node v if the label of v is of 
the form {a, Z') and z e Z' . If each z G Z occurs exactly once in t G E^M^, 
then t is called a Z-structure of rank k over E. We note that a Z-structure 
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uniquely determines a tree t G T,M and a mapping X: Z ^ NV(i). Conversely, 
any such pair {t, A) determines a unique Z-structure, written str(t, A). Now let 
(fi he a, rank k Lind-formula with free variables in a set Y. Let str(t, A) be a 
Z-structure with Z CY. If i^-.Y \ Z — > NV{t), we write (str(<, A), /i) (p if 
{t, [A; /i]) \= (f, where [A; /i] is the map from Y to NV(i) determined by A and /i. 
If Z = Y, wc write simply str(t, X) \= (p and wc say that str(t, A) satisfies (p. We 
let be the set of y-structurcs satisfying (p. 

Example 2.14 Let a E S„j, and let ip be the rank k formula ip ~ Pa{x). Let 
y be a set containing x. Then L^p is the collection of all y-structures of rank k 
over S such that some (necessarily unique) node has a label of the form {cr,Y') 
with X £ Y' . It is immediate to observe that any two trees of equal rank in "EM 
have the same contexts in Lip, that is. the restriction of the syntactic congruence 
of Lp to SM is the universal relation. The same holds for any atomic formula 
ip. □ 



2.3 Properties of the operator Cind 

We now explore the properties of the operator Cind on families of languages. 
2.3.1 Cind is a closure operator 

Theorem 2.15 Cind is a closure operator on classes of languages. That is, for 
all language classes /C and IC' , the following holds. 

(1) ICC Cind{JC); 

(2) if ICQ K.' then Cind{lC) C Cind{lC'); 

(3) Cind{Cind{IC)) C Cind{lC). 

Item (1) follows immediately from Example 12.71 and Item (2) is immediate 
from the definition. The rest of Section 12.3.11 is devoted to the proof of Item 
(3). 

Let phe a Lind(£zn(i(/C))-formula of rank k over E. We argue by induction 
on the structure of p to show that there is an equivalent formula p of Lind(/C), 
that is, a formula with the same free variables as (p and such that L,^ = 
in EyMfc for any finite set Y containing the free variables of ip. This will be 
sufficient to prove Theorem 12.151 

If ip is an atomic formula, we let ip = tp, since p) is also a Lind(/C)-formula. 
If ip = ipi W P2 (resp. ip = -^pi), we let (p = (piM (p2 (resp. p = ~^Pi)- The 
equivalence of p and p is easily verified. 

The last case occurs when p is of the form p = Qkx ■ {ps)s£A, where 
K € Cind{1C) and the ps form a family of rank k formulas of Lind(£md(/C)) 
over S that is deterministic with respect to x. In particular, K = where ijj 
is a rank k Lind(/C)-sentence over A. 
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By induction, for each S G A, there exists a Lind(/C)-formula ips equivalent 
to (fs, so that ip is equivalent to Qkx • {p>s)seA- Thus, we may assume that the 
(fis are Lind(/C)-formulas. 

Before we proceed with the end of the proof, we establish a technical fact. If 
X is a formula and p, q are variables, we denote by xl'Z/p] the formula obtained 
from X by substituting the variable q for all free occurrences of p. (Bound 
occurrences of g in x are renamed as usual.) 

Let X be a rank k formula over A. We then define x to be the rank k formula 
over E obtained from x by replacing each subformula of the form Ps{z), where 
z is any first-order variable, by the formula (ps[z/x]. Since the quantifiers in x 
also occur in x, and the quantifiers in x occur either in x or in the (ps, it is clear 
that X is a Lind(/C)-formula if and only if x is one. In the sequel, we assume 
that neither x nor any free variable of one of the ips, is free in x, and that no 
free variable has bound occurrences in the formulas under consideration. Let 
us then assume that Y (the finite set containing the free variables of (p and not 
containing x) also contains the free variables of x- 

Fact 2.16 With the notation above, let t G SMfe, let X:Y ^ NV(t) be a func- 
tion, and let i\ G AAffc be the characteristic tree determined by t, A and the 
formulas ips- Then we have 

(t,A)hx ^ (iA,A)hx- 

Proof. We argue by induction on the structure of x- Suppose first that x = 
Ps{z). Then x = 'Psiz/x]. Let fi be the restriction of A to F \ {z} and let be 
the characteristic tree determined by t, ji and the (ps[z/x\. A node v is labeled 
5 in tx if and only if {t, Xv) = (t, [A; x i— > u]) \^ (ps- Since z docs not occur in ips, 
this is equivalent to (i, z i—^ v]) \= ips[z/x], and hence to the labeling of v by 
S in if^. Thus t\ = t^. Then we have: 

(i, A) \^ X ~ P5[z/x] <;==^ (t, [/i; z i-^ -^(-z)]) |= ^s[z/x] by definition of /i 
^ {t^, [z ^ A(;^)]) = (iA, [z ^ A(z)]) ^ Ps{z) 
^ {txA)^Ps{z) = x- 

If X is another atomic formula (namely, zi < Z2, SucCi(zi, Z2), root(a;), maxi.j(z), 
\ehj{z) or rightj(z) with z, zi,Z2 S YU{x}), then x X- Since t and i\ have the 
same variable-labeled nodes and they differ only in the labeling of their nodes, 
and since x does not depend on that labeling, we have in each case 

(i,A)hx ^ {tx,\)hx- 

We have now established our claim for atomic formulas. 

The induction step is immediate if x is of the form X ~ Xi V X2 or x = ~'Xi- 
Wc now assume that x = Qlz ■ (Xw)i^esi where L C flMk, z ^ Y U {x}, and 
{X(jj)(jj&^ is a family of rank k formulas over A with free variables in y U {z}, 
which is deterministic with respect to z. 
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By construction x = Qlz • {Xui)uieci, and by induction hypothesis, for each 
node w G NV(i) and for each e fi, we have 

{t,[X;z^w])^Xoj {tx,[^'-,z ^ w]) \= Xuj- 

It follows in particular that (xi^)i^go is deterministic with respect to z. More- 
over, the characteristic tree determined by t, A and {xuj)ui£n is the same as that 
determined by A and {Xuj)ujen- Thus we have 

(t,A) hx ^ (iA,A) hx, 

which concludes the induction and the proof. □ 

We now return to the proof of Theorem 12. 151 Recall that (p = Qrx ■ {'Ps)s£A 
and K = C AM^ for some rank k Lind(/C)-scntence x over A (without free 
variables). We want to construct a formula in Lind(/C) that is equivalent to ip 
and we claim that x is such a formula. 

Indeed, let t G EM^, let A be a mapping X:Y ^ NV(i), and let tx G AMj, be 
the characteristic tree determined by t, A and the ips- By definition. (<, X) \^ ip 
if and only if tx € K, that is. ix ^ or equivalently, (tA,A) ^ ip. It was 
established in Fact 12.161 that this is equivalent to {t,X) |= t/j, which concludes 
the proof. □ 

2.3.2 Closure properties of Cind{JC) 

The objective of this section is to prove the closure properties summarized in 
Theorem l^m below. 

Theorem 2.17 Cind{lC) is closed under Boolean operations and inverse literal 
morphisms. Moreover, Cind{JC) is closed under left (resp. right) quotients if 
and only if any left (resp. right) quotient of a language in IC belongs to Cind{IC). 

We now prove Theorem l2.171 by considering separately each closure property. 

Boolean operations The fact that Cind{IC) is closed under the Boolean op- 
erations follows directly from the fact that Lind(/C)-formulas are closed under 
disjunction and negation. □ 



Inverse literal morphisms Let /i: E' E be a rank-preserving mapping, 
and let us also denote by h the induced morphism /i:(E'A/, E') (EA/, E). 
Note that if i is a tree, then h{t) differs from t only in the labeling of the nodes 
in NV(t). Let ip he a. rank k Lind(/C)-formula over E with free variables in a 
finite set Y. We show by structural induction on ip that there exists a rank 
k Lind(AI^)-formula p' over E', with the same free variables as ip, and such 
that (i. A) ^ ip' if and only if {h{t),X) \= p for any tree t G E'M^ and any 
interpretation X:Y ^ NV(t). 
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If If = Pc{x) for some cr G S, we let tp' = \J Pa'{x), where the disjunction 
runs over the letters a' S E' such that h{a') = a. If (p is another type of atomic 
formula, then ip docs not depend on the labeling of the tree, and it suffices to 
choose (f' = If. 

The inductive step for the Boolean connectives is equally natural: if 1^9 = 
ifi V (p2 (rcsp. (p = ""/^i), then we let ip' — ip'i V ip'2 (resp. ip' = ~'(Pi)- 

Suppose finally that ip is of the form Qkx ■ {'P5)seA- By induction, there 
exist formulas ip'g over E' such that, for each S, {t, Xy) \= ip'g if and only if 
{h{t), Xy) \= ips for any tree t £ S'M^, node v in t and mapping X:Y ^ NV(t). 
It follows that the characteristic tree determined by <, A and {(p'§)s<£A, and the 
characteristic tree determined by h{t), X and {(p5)seA coincide. As a result, we 
have {t, A) ^ (p' if and only if {h{t),X) ^ ip- □ 



Left quotients We now assume that any left quotient of a language in IC 
belongs to Cind{JC). Let ki,k2,ihe non-negative integers and let k = ki +£+k2- 
Let also iphe a rank k Lind(/C)-formula over E with free variables in a finite set 
Y, and let U = str(u, /i) be a Z-structure of rank fci + 1 + fc2 for some Z CY. 
(Without loss of generality, we may assume that u ^ 1.) Let X = Y \ Z. We 
prove by structural induction on tp that there exists a rank i Lind(/C)-formula 
ip' over S, with free variables in X and such that, for every tree t € SM^ and 
every mapping X: X ^ NV(i) (see Figure S]), we have 

{t,X)^ip' ^ (C/-(ki©<©k2),A) h'P- 
If is a formula without free variables {X = Y = Z = (l),U = u G 

EMfej-i-i+fej), this shows that L^i = {u,ki,k2)^^L^, and hence that Cind{lC) is 

closed under left quotients. 

We now proceed with the proof. We first observe that NV(t) may be viewed 

as a subset of NV(J7 • (ki © i © k2)): more precisely, the latter set is equal to 

the disjoint union of NV(t) and NV(t/) = NV(m). 




a Z-structure 



variables in X arc interpreted here 



Figure 4: 5' = J7 • (ki © t © ka) 

If ip is equal to Pa{x), we let ip' ^ ip if x ^ Z, and ip' ~ true (rcsp. false) if 
X G Z and x occurs at a node of U for which the first component of the label is 
(resp. is not) a. That is, if x G Z and U satisfies (resp. does not satisfy) ip. 
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Let now = leftj(x) (resp. rightj(x), maxij{x)) with 1 < j < k. li x ^ Z, 
we let = leftj_fej (x) if fci < j < ki (resp. right^.^.^ {x) if ki < j < ki+£+l, 
maxi.j_fcj (a;) if ki < j < ki + £), and i^' = false otherwise. li x G Z, then it 
does not depend on t and A whether ([/ • (ki © t © k2), A) ^ 1^9 or not. and we 
let (fi' = true or false accordingly. 

If (y9 = root(a;) and x ^ Z, we let (ys' = false. If x <E Z, we let ip' = true or 
false depending on whether U \= ip. 

Now consider the case where ip = (x < y). If x,y ^ Z, we let p' ~ ip. If 
X ^ Z and y G Z, we let = false. If x e Z but y ^ Z, we let 1^9' = true 
or false depending whether the node of U where x occurs in an ancestor of the 
{ki + l)-st variable leaf. Finally, if x,y Q Z, we let ip' = true or false depending 
whether U \^ ip. 

The last case of an atomic formula occurs if ip is of the form SucCi{x,y). If 
x,y ^ Z, we let p' = p. If x,y G Z, we let p' = true or false, depending on 
whether U \= ip. If x ^ Z and y G Z, we let ip' = false. Finally, if x G Z and 
y ^ Z, let w be the node of U where x occurs. If the i-th successor of w is the 
(fci + l)-st variable leaf, we let p' = root(?/); otherwise we let ip' = false. 

As usual, if p — pi \/ p2 (resp. p = ~'Pi), then we let p' = p'l V p'2 (resp. 
ip' = -,(^;). 

We now consider the case where p = Qkx ■ {p5)s£A- We may assume that 
X y . Let S* = f7 • (ki © < © ka), let s = u • (ki © t © kz) and let X: X ^ NV(t). 
Let sx be the characteristic tree determined by s, [A; /i] and {p5)seA- Then, for 
any v G NV(s) and for any S, we have 

{sx,[x ^ v]) Ps{x) -^=^ {s,[X;fj,;xi-^v])\=ps 

{S, [X;x v]) h PS. 

Moreover, 

(5,[A;M])h^ ^ (^,A)h^ ^ SAGA-. 
For each 6 G A, let p'g be the formula associated with ips and J7 by the 
induction hypothesis. Let t\ be the characteristic tree determined by t, X and 
{f's)^&^- Then, for any node v G NV(t), we have 

{S,[X;x ^ v]) \^ PS {t,[X;x t-^ v]) \^ p's 

and hence s\ is of the form s\ = u - (ki © © k2) for some tree u which differs 
from u only in the labeling of the nodes in NV(u). 

For each v G NV(u), we let J/*-""^ be the (ZU {x})-structure obtained from U 
by adding x to the second component of the label of v. Then, for each (5 G A, we 
let tps^y be the formula associated with ips and J/'-'"-' by the induction hypothesis. 
Then we have 

{t,X)^iPs,v ^ ([/("'■ (ki ©t©k2). A) h 

{S, [X;x ^ v]) 1= PS 
-^=^ {sx,[x i-^ v]) \= Ps{x). 
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Now, for each mapping a: NV(m) A, let Ua be the tree obtained from u 
by relabehng each node v G NV(u) with ol(v). Let also be the conjunction 
of the tpa{v),v when v runs over NV(w). Then 

(t,A)hV'a ^ SA = Ua ■ (ki ©iA ©ka). 

Finally, let 

</5" = V (V-a A (9(.a„,fci,fc2)-i/<('^5)5), 
a 

where the disjunction runs over all mappings a: NV(it) A. Then the above 
discussion establishes that {t, A) satisfies ip" if and only if {U ■ (ki © t © ka). A) 
satisfies (p. Moreover, since each (uq,, fci, k2)~^K is in Cind{JC), the formula (ys" 
is a Lind(£md(/C))-formula, and by Theorem I2.15i ip" is equivalent to some 
Lind(/C)-formula ip' , which concludes this proof. □ 



Right quotients The proof concerning the closure under right quotients is 
similar. We assume that every right quotient of a language in JC belongs to 
Cind{IC). Let fc > and let (p he a. rank k Lind(/C)-formula over S with free 
variables in a finite set Y. Let n > 1 and Z C Y, and let [/ = J7i © • • • © J7„ G 
E^Mnjt where each Ui is a Z^-structure of rank ki, Ui = str(ui, ^i), k ~ ki 
and the Zi form a partition of Z. Let u = (BtUi, ^ = [^i, . . . , and X = Y\Z. 

We show by structural induction on (p that there exists a rank n Lind(/C)- 
formula ip' with free variables in X such that, for every tree t G SA/„ and every 
mapping X: X NV(t) (see Figure[5]), we have 

(t,A) h^' <=^ it-U,X)^ip. 




variables in X are interpreted here 



a ©-sum of Zi-structures 



Figure 5: S^t-U 

We first consider the case of atomic formulas, li ip = P„{x), we let tp' ~ ip \i 
X ^ Z and ip' = true or false if x G Z, depending on whether U satisfies p. 

If (/? = root(a;), we let ip' = ip \i x ^ Z , and false if x G Z. 

If = maxij(x) and x ^ Z , we let ip' — ip \i j — ki + ■ ■ ■ + fc/i-i + 1 
for some h such that Uh = ^, and lys' = false otherwise, li x E Z and fci + 
• • • + kfi^i < j < ki + ■ ■ ■ + kji, wc let p' = true or false depending on whether 
Uh h max,j_(fe^+...+fc^_^)(x). 
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Suppose now that ip = {x < y). If x,y ^ Z,we let ip' = </?. If x, y G Z, we let 
(f' = true or false depending on whether one of the Uj satisfies cp. li x & Z and 
y ^ Z, we let ip' ~ false. Finally, ii x ^ Z and y G Z, let 1 < j < n be such that 
y £ (i.e. y occurs in Uj). Then we let ip' = Vi<j lefti(x) A Vj<h '''g'^t?i(^)- 

The situation is similar if (p = SucCi{x,y). If x,y ^ Z, we let (p' = ip. If 
x,y E Z, we let (p' = true or false depending on whether one of the Uj satisfies 
(p. If X E Z and y ^ Z, wc let Lp' = false. Finally, if x ^ Z and y G Z, let j be 
such that y £ Zj. If y does not occur at the root of Uj, we let Lp' ~ false. If y 
does occur at the root of Uj, we let ip' = maxi j{x). 

Finally, suppose that (p = leftj(a;) (resp. rightj(a:)). If x £ Z, let i be 
such that X £ Zi. Then we let pi' = true or false according to whether 
fci + • • • + fci_i < j < /ei + • • • + ki and satisfies leftj_(fcj+...+/j;_j) (a;) (resp. 
right^_(fe^+...+fc,_^)(x)). If X Z, we let p' = leftft(x) if j = Ei<ft>'i (resp. 
j = 1 + X]i</i ^i) f'^r some ft,, and pJ = false if j is not of that form. 

\f p ~ p}\\l P2 (resp. p = ^P\), then we let (/j' = pJ^ V (/32 (resp. p' — ~'p'i), 
and we now assume that (p = Qkx ■ {ps)s^ii, with x ^Y. Let S = t ■ U , let 
s — t ■ u and let A:X — > NV(t). Let sa be the characteristic tree determined 
by s, [A;/i] and {ps)seA- For each S E A, let v'i be the formula associated 
with ips and U by the induction hypothesis, and let tx be the characteristic tree 
determined by t, A and {p}'g)s<EA- Then the tree s\ is of the form s\ =tx ■ ii for 
some tree ii which differs from u only in the labeling of the nodes in NV(m). 

We continue as in the left quotient case. For each v E NV(u), we let U^"""^ he 
the structure obtained from U by adding x to the second component of the label 
of V and for each 5 G A, we let ips^v be the formula associated with (pg and 
by the induction hypothesis. As above, we verify that if Ua is the relabeling of 
u determined by the mapping a: NV(u) — > A, and if ipa is the conjunction of 
the 4'a{v),v (over the nodes v E NV{u)), then 

(i, A)|=Va sx = tx-Ua- 

We then let 

a 

where the disjunction runs over all mappings a: NV(u) — > A, and we note that 
(i. A) satisfies p" if and only if {t ■ U,X) satisfies p. Since each Ku~^ is in 
Cind{IC), the formula p" is a Lind(£md(/C))-formula, and hence is equivalent 
to a Lind(/C)-formula ip' , which concludes the proof. □ 

2.4 Logics admitting relativization 

Wc say that a fragment L of Lind admits relativization if Properties Rl and 
R2 below hold. 

Property Rl For all integers /ci, fc2 > and k > ki + k2, for each L-sentence 
(p of rank ki + 1 + k2 over an alphabet E and for each first-order variable x 
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without occurrence in 95, there exists an L-formula (^[^ x] of rank k in the free 
variable x with the following property. For each tree t € T,Mk and for each node 
V £ NV(t). {t,x t-* v) satisfies Lp[:^ x] if and only if 

• if s is the subtree of t with root v, then t is of the form t = r ■ (ki © s © 
(see Figure [6]), and 



Figure 6: The factorizations of t in Properties Rl and R2 respectively 

Property R2 For all ranked alphabet S, integer i > 1 less than or equal to 
the maximal rank of a letter in E and integers ki,k2,£ > 0, for each rank i 
L-sentence ip and for each first-order variable x without occurrence in ip, there 
exists an L-formula ip[> xi] of rank ki + £ + k2 over S in the free variable x 
with the following property. For each tree t £ Y,Mki+£+k2 and for each node 
V € NV(t), {t,x t— > v) satisfies (p[> xi] if and only if 

• the rank of v is greater than or equal to i, and its i-th child, w, has rank £ 

• if s is the subtree of t with root w, then t is of the form t = r- (ki ©s©k2), 
and s 1= (y3 (sec Figure [S]). 

Proposition 2.18 Let K, be a class oj tree languages containing /Cg and closed 
under the following operations. Let fci,fc27^ ^ 0, let A 6e a ranked alphabet 
and let E be a disjoint ranked alphabet such that card{En) ~ 1 if An 7^ 0, and 
card{En) = otherwise: if K C AMk^+i+k-i belongs to JC, then K ■ (ki ©E'Mf © 
ka) e Cind{IC); if K C AM/, belongs to K, then EMki+i+k2 • (kj © if © ka) G 
£ind{IC). 

Then Lind(A^) admits relativization. 

Proof. We first consider Property Rl. Let ki,k2 > 0, let fc > fci + fc2 and let 
<p be a Liind{IC) formula over S, of rank ki + 1 + k2, without any occurrence of 
X. We show by structural induction on Lp that there exists a rank k Lind(A^)- 
formula ip[^ x] where x is a free variable and such that, for any tree t £ T,Mk, 
the following holds: if v G NV(t) and t ~ r ■ (ki © s © k2) with the tree s rooted 




fc - (fci +k2) 



£ 
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at V, and if A: F — > NV(r) is an interpretation (where y is a set containing the 
free variables of ip and not containing x), then 



If </j = leftj(y) with j > ki, we let x] = \eftj+e{y), where £ = k — {ki + k2)- 
li Lp = right^(?;) with j > ki + 1, we let x] = rightj,,.^ (y). If = maxij(?/) 
with i > ki + 1, we let ip[^ x] = maxij+£_i(?/). And if = ma:x.i^ki+i{y), we 
let ip[^ x] ~ 5ucci{y,x). 

For all other atomic formulas, we let ip[^ x] = ip. It is elementary to verify 
that these choices guarantee the expected equivalence. Similarly, if (p ^ ipiV ip2 
(resp. ip = -^ipi), we let (^[^z^ x] = (^i[;2^ V (/32[/ a^] (resp. x] = -'<y5i[?^ a;]). 

Let us now assume that tp = QkU ■ (</5<5)i5eA where K C AMfcj^+i+fe^ is in 
/C, 2/ ^ y U {a;} and the tp^ are deterministic with respect to y. Let £" be a 
ranked alphabet disjoint from A, with a single rank n element e„ for each n 
such that An 0; and let A' = A U -E. Let L = isT • (ki © EMe © k2); then 
L € Cind{JC) by assumption. For each i5 G A, we let 7/15 = ^(y > x) A V'ai?^ x\\ 
and we let ipe = iv ^ x) for each e S We note that the ij.'s have their 
free variables in K U {x,y}. Using the induction hypothesis, one verifies that 
{tps)seA' is deterministic with respect to y, and we let = QLy'{'4'5)sGA'- Then 
7/; is a Lind(>Cmc?(/C))-formula, and by Theorem 12. 151 there exists an equivalent 
Lind(A^)-formula -0'. 

By the induction hypothesis, for every w in NV(7') and 6 £ A, (t, [A; x > 
v,y ^ w\) \= ips if and only if (r, [X,y 1— > w]) |= (/j^. Also, (i, [A;x 1— > u,y n- 
^]) H (2/ > a;) for all w S NV(s). Thus, the characteristic tree determined by t, 
[X;x 1-^ v] and the 4's is of the form f • (ki © s © k2), where f is the characteristic 
tree determined by r, A and the (ps, and where each w G NV(s) is labeled in E. 
Thus, letting ip[:;i x] = ip' , we have the desired equivalence. 

Let us now consider Property R2. Let i > 1, ki, k2,i > 0, let k = ki + i + k2 
and let iphe a rank i Lind(/C)-formula over E without any occurrence of x. We 
show by structural induction on ip that there exists a rank k Lind(/C)-formula 
ip[> xi] where a; is a free variable and such that, for any tree t £ SM^, the 
following holds: if u G NV(t), then 



where s is the subtree of t rooted at the i-th successor of v and X:Y ^ NV(s). 

If V? \ehj{y) (resp. rightj(y), max,jj(y)), we let p[> xi] = \ehki+j{y) (resp. 
i'ightfe^+j(y), maxh,ki+j{y))- If V = root(y), we let (p[> xi] = SucQ(a;,y). For all 
other atomic formulas, we let ip[> xi] — (p. If ip ^ ipi W ip2 (resp. ip ~ -^ipi), we 



•^To justify this choice of xpg, we need to verify that y = x is expressible: it is equivalent to 
Vz A"_j^ Succi (x, z) <-> Succi {y, z) ASucc^ (z, x) Succi(z, y) where n denotes the maximal rank 
of a letter in E. The presence of a universal quantifier is acceptable since we have assumed 
that K, contains /Cg. 



(t, [X;x^v])^^[;ix] 



(r,A) h¥'- 




V has rank at least i, 

t factors as t = r • (ki © s © k2) 

(s. A) h V, 
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take ip[> xi] = ipi[> xi] V <P2[> xi] (resp. ip[> xi] = -^ipi[> xi]). Again, it is 
elementary to verify that these choices guarantee the expected equivalence. 

Let us now assume that f — QkU ■ {'•Ps)s£A where K C AiW^ is in /C, 
y iJ {x} and the tps are deterministic with respect to y. Let E and A' be 
as in the first part of the proof, and let L = EMk^+i+k^ ■ (ki ® K @ k2); then 
L S Cind{IC) by assumption. 

For each n > such that S„ ^ 0, let Xn be the formula SucCi(2:, z) /\{z < y) 
(independent of n), and let x = Qxhi^)^' iXn)- By assumption, x is a Lind(/C)- 
formula. Moreover, [t, [x t-^ v;y w]) satisfies x if and only if v has rank at 
least i and w is a descendant of the «-th child of v. 

For each e € E, let = ^Xi and for each i5 G A, let = xi] A x- 

By induction, the ips [5 £ A') are Lind(A^)-formulas with free variables in 
Y U {x,y}. Using the induction hypothesis again, one verifies that {'ips)sGA' 
is deterministic with respect to y, and we let -0 = Qhy ■ {4's)5eA'- Then ip 
is a Lind(£m(i(/C))-formula, and by Theorem 12.151 there exists an equivalent 
Lind(A^)-formula -0'. 

It follows as above that (t, [A; a; f-> f]) \= ip' if and only if the rank of v is 
at least i and t factors as i = r • (ki © s © with (s. A) ^ ip, where s is the 
subtree of t with root v. □ 

This can be applied to the classes /Cg and /Cg.mod discussed in Example 12. 131 

Corollary 2.19 The logics Lind(/C3) and Lind(/Cg.mod) admit relativization. 

Proof. Let A be a ranked Boolean alphabet, let i? be a disjoint ranked alphabet 
as in the statement of Proposition 12.181 and let A' = A U i?. Denote by the 
element of rank n in E, if it exists. Let ki,k2,£ > 0, and k = ki + £ + k2- 

Let K = Kk (3) and L — K ■ (ki © EMi © k2). Define, for each m > 

(x<2/)A-Pe„ (y), 

QKk{^)y ■ {Xn)n, 

^{x<y)AP,^iy), 

leftfcj (x) A righti,^+(,_|_i(a;) A -^x A -iw and 

QKk{B)X ■ {llJn)n- 

Then a tree t e A'Afj. satisfies (^9 if and only if a letter of the form 1„ occurs 
at least once in t; [t, [x > v]) satisfies x (resp. to) if some descendant (resp. 
non-descendant) of v has its label in A (resp. in E)] and t satisfies ip if and only 
if t can be factored as i = r • (ki © s ffi k2) with all the nodes in NV(s) labeled 
in E and all the nodes in NV(r) labeled in A. It is immediate that L is defined 
by the Lind(/Cg)-formula ip Aip. 



such that ^ 

Xm — 

X = 

UJ = 

i'm = 
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Now let K = Ki{3) and L = EMk, +i+k2 ' (ki ® K (5^2). Define, for each 
m > such that S,„ ^ 

Xm = -'(2^ < y) A -'P£„(y), 

X = (3Kfc(3)y • (Xn)n, 

= {x < y) A Pe^iy), 

^ = <3A'fc(3)y • (t^rOn, 

V-m = leftfej(a-) A righti,^+(,_^i(a;) A -.X A -.w and 

Then if t S A'Mk, wc have (t, [x ^ v]) \^ uj (resp. x) if some proper descendant 
(resp. non proper-descendant) of v has its label in E (resp. in A); and t satisfies 
ip if and only if t can be factored as t = r • (ki © s ® with all the nodes in 
NV(r) labeled in E and all the nodes in NV(s) labeled in E. It is immediate 
that L is defined by the Lind(/Cg)-formula ip Aip. 

The proof that Lind(/C3_niod) admits relativization is similar. □ 

3 Algebraic characterization of logically defined 
tree languages 

3.1 The block product of preclones 

In this section, we introduce our main algebraic tool, the block product of 
preclones and of pg-pairs. This is a generalization of an operation on monoids 
that was introduced by Rhodes and Tilson [5S] , as a two-sided generalization of 
the more classical wreath product. 

Let us first (attempt to) briefly summarize the spirit of the block prod- 
uct of monoids, which was introduced |28| in relation with the description 
of bimachines (Eilenberg [TT]). Let T be a finite monoid and let t: A* 
T be a morphism. The associated bimachine T represents the simultane- 
ous operations of left-to-right and right-to-left processing of a string in A* 
by r: if ai • • • a„ G A*, the i-th component of this processing is the triple 
(r(ai • • • fli^i), Ui, T(ai_|_i • • • a„)) £ T x A x T, and the output of T is the prod- 
uct of these components, namely the following string in {T x A x T)*: 

(1, ai,T{a2 ■ ■ ■ a„))(T(ai), a2,T{a3 ■ ■ ■ a„)) ■ ■ ■ (T(ai ■ • • a„_i),a„, 1). 

The idea of the block product is to capture the (cascade) product of this bima- 
chine with an ordinary automaton, that is, to use the output of the bimachine 
as input for another automaton S operating on alphabet T x A x T. This 
translates to a monoid morphism a: {T x A x T)* — > S into a finite monoid S 
(the transition monoid of iS) - which is entirely determined by the images of 
the triples {t, a,t') £ T x A x T. For each a G A, let us denote by fa the map 
fait, t') ~ a{t, a, t'). Then the composed machine output, on input ai ■ ■ ■ a„ is 

/ai(l,T(a2 ■ • ■a„))/a2(T(ai),T(a3 • ■ -an)) ■ ■ ■ /a„(T(ai • --an-i), 1). 
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Note that fa{t, t') is the cr- image (the S'- value) of the effect of letter a in bima- 
chine T, when a is in a left-right context whose T- values are t and t' . The map 
fa itself records the effect of letter a in all possible contexts. 

The general definition of the block product of monoids is an abstraction 
of these ideas: 5* □ T is the set of pairs (/, t) G S'^^'^ x T and the product 
■ ■ ■ ifn,tn) is equal to (5,ti ■ • -tn), with 

g{t, t') — t2 - ■ ■ tn)f2{tl,t3 ■ ■ ■ t„) ■ ■ ■ /„_l(il • • ■ tn-2, tn)fn{tl ' ' ' ^n-l, !)■ 

This operation on monoids proved to be useful to decompose morphisms [281 
[50] and to explain the connection between first-order logic and aperiodic 
monoids (see [35]). We now extend these ideas to preclones. The resulting 
definition is more complex as our contexts are not just left-right pairs (see the 
definition of contexts in Section II. 3p and we need to take into account the rank 
of elements. In particular, this leads to the definition of a sequence of block 
products SUkT (fc > 0). 

Formally, let S and T be preclones. We define preclones S Dfc T for each 
fc > 0. Recall fSection ll.Sp that, for each fc, n > 0, Ik.n denotes the set of n-ary 
contexts in Tk- The set of rank n elements oi SU^T is defined to be 

{SUkT)n = si'-" xTn, n>0. 

The identity 1 is the pair (i^i, 1), where Fi(C) = 1, for all C <E Ik,i- As for 
the composition operation, let (F, /) e {S Dfc r)„, and let {d, gi) G [S Uk T)mi 
for each i G [n]. Let g = gi (B ■ ■ ■ (B g-n G Tn,m, where m = Y^^=i "^j- Then we 
let 

(F, /) • ((Gi, 5i) © ■ • ■ ® (G„, = (Q, / • g), 

an element of (S Dfc T),„ = S*™ x r,„, where Q:Ik,m Sm is described as 
follows. 

For each (u, ki, v, G /fc.m, we have v = vi® ■ ■ ■ ©w,„ G Tm,i, where t = 
k — (ki +k2)- Let vi be the ©-sum of the first mi Wj's, Vi — Vi® ■ ■ ■ ® let 
V2 be the ©-sum of the next m2 Vj^s, etc, until u„ — Um-m„+i © ■ ■ ■ © I'm is the 
©-sum of the last m„ Vj's, see Figure [71 In particular, v = ®"^i ui. For each 
i G [n], let ii be the total rank of v^, so that Vi G Trm/i and = 

For each i G [n] , we observe that gi ■ Vi € Tg. , and we let 

C; = It • (|ki © / • (51 • wi © • • • © .gi_i • Wi.i © 1 © gi+i ■ Vi+i © • • • © g„ • © . 

We note that w-(ki©/-p--(;©k2) = Cj • (pi ©ffi -Wi ©P2), where pi = ki+Yl,j<i^3 
and p2 = X]j>i + ''■a- Then Ci is an element of T with rank pi -I- 1 -I- P2 = 
fci -I- ^2 -I- ^ — + 1 = k — + (Of course, the integers pi and p2 depend on i 
even though our notation does not show it.) 

In particular, Ci = (ci,pi, u,;,p2) is a context in Ik,mi, see Figure [71 We are 
finally ready to define Q: 

Q{u,ki,v,k2) = F{u,ki,g-v,k2) ■ (Gi(Gi) © • ■ • © G„(G„)). 



28 




Vn 



Figure 7: Two views of (u, ki,k2,v) € Ik.m, and the context C„ 

Lemma 3.1 The above definition satisfies the axioms of preclones. 

Proof. Let us first verify the axioms concerning the identity element. Let 
{G,g) e (S-DfcT)™ and let (Q,<?) = (i^i,l) • (G,^). Let (w, fci, fe) G 4,,„. 
With reference to the notation in the definition above, we have n ~ I and 
Ci = {u,ki,v,k2)- It follows that Q = G, so (i^i,l) • {G,g) = {G,g). 

Let now [F, f) e (^□fcT)„ and let (Q, /) = (F, /)-((Fi, !)©• • MFi, !))■ Let 
(u,ki,v,k2) e Ik,n- Theng(M,fci,'y,A:2) = fci, w, A:2)-(i^i (Gi)®- • •©i^i(G„)) 
for some Gi, . . . , G„, and hence fci, fe) = fci, w, A;2) • (1 ® • • • © 1) = 
F{u, ki,v, fe). Thus (F, /) • ((Fi, 1) © • • • © (Fi, 1)) = (F, /). 

Next let (F,/) e {S Dfc T)„; for i G [n] let (Gi,gi) S {S T)m, and let 
= EjgM "^^J i ^ H: let {Hj,hj) G (5 Dfe T)p^. and let p = J2je[m] Pi- 

Let g = fflig[„]5i and let h = ffije[m]^j- We also denote by hi the ©-sum of 
the first mi /ij 's, the ©-sum of the next m2 /ij 's, etc, to the ©-sum of the 
last m„ /ij's, so that h = ©ig[„]/ii. The rank of hi is Zl^'mi+^"+m,_i+i Pj- 

We need to consider p-ary contexts in T^: let {u,ki,v,k2) G Ik,p be such a 
context. Then f = f i © • • • ffi Up G Tp^^ with £ = k ~ ki — k2. Let -Di denote 
the ©-sum of the first pi f^'s, 1)2 the ©-sum of the next p2 w^'s, etc to Vm the 
©-sum of the last p,„ I'i's. For i G [n], we also denote by Vi the ©-sum of the Vj 
where hj is part of the summation defining hi. That is, t;i = wi © ■ • • © ,■ ■ ■ , 

Vn = Vm-m„+l © ' ' ' © Wm • 

We first consider the product 

((F, /) ■ ((Gi,gi) © ■ • ■ © (G„,5„))) • ((i?i, /ii) © • • • © (-ff™, 
= (g,/-.g)-((i?i,/ii)©---©(-ff™,/i™)) 
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= (RJ-g-h). 

Then we have R{u, fci, ^2) = Q{u,ki,h-v,k2)- © • • • ©i7,„(-B,„)), with 
Bj ~ {bj,p[,Vj,p'2) where bj {j G [m]) is 

u ■ (ki © / • g ■ (/ii ■ ui © ■ • ■ © hj-i ■ Vj-i © 1 © hj+i ■ Vj+i © ■ • ■ © /i™ • -Dm) © k2), 
p[ = ki + Y^izl rank(ws) and ^2 = ^2 + S^j+i rank(ws), so that 
u ■ (ki © / • 5 • /i • V © ka) = 6j • (p'l © /ij ■ w^- © p'2). 




Figure 8: The contexts C„ and Z?; 



Moreover, Q{u, ki, h ■ v, ^2) = F{u, kyg-h-v, k2) • (G'i(Ci) © • • • © G„(C„)), 
where Cj (i S [n]) is the context (q, ri, /i^ • Wi, r2) with q equal to 

M-(ki©/-(gi-/irl5i©- ■ •©gi_i-/ij_rWi_i©l©g^+i-/ii+r!)i+i©- • ■©5„-/i„-w„)©k2) , 
T"! = fci + Es=i rank(l;s), r2 = fc2 + J21i=i+i r3rtk{vs), so that 

u ■ {ki® f ■ g ■ h ■ V ®k2) = Ci ■ {ri ® gi - hi -Vi® r2). 
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See Figure [i Thus 

n rn 

R{u, k,,v,k2)^ F{u, fci, g . . z;, ■ ( G, (Q)) • ( H, (B^ )) . 

i=i j=i 

We compare this result with the product 

(F, /) • (((Gi,gi) © ■ • ■ © (G„,5„)) • ((i/i, /li) © • ■ • © {Hm,hrn))) 

= (F,/). ((Q'i,5i./ii)© •••©(Q;,5„ 

Then we have R'{u, fci,D, ^2) = F{u,ki,g- h ■ fe) ■ (QU-Di) © • • • © (5J,(I?n)), 
where Di {i € [n]) is the context (q, ri, lii, r2), where Ci, ri, r2 are defined above. 
Next we compute Q[{Di): we have 

Q[{Di) = Q[{ci,ri,di,r2) = G(ci, n, /ii • ^i, ra) • (Bi(Fi) © ■ • ■ © iJ^i (F„, J) 

where Ej {j e [toi]) is the context {ej,r[ j,Vj,r2 j), 

ej = u • (ki © / ■ (gi • (/ii • ui © • ■ • © ■ © 1 

©/ij + l • Wj + i © ■ • ■ © ■ Vmi ) © 52 ■ /l2 • W2 © ■ ■ ■ © 3n ■ /in • S„) © k2) 

and r[ ^ and r2 ^ are appropriate integers so that ci • (ri © 171 • ft-i • ui © r2) = 
' ® ■ © i"2j), see Figure [8] 

We observe now that ej ~ bj and Ej = _Bj for j G [toi]. So we have 
Q'iPi) = Gi(Gi). 07^,^,(5,). 
Similarly, for each i G [n] , we have 

mi 

Qi{Di) = Gi{Ci) ■ HraiJi hmi_i+j(-BmiH hmi-i+j), 

i=i 

and we have verified that R ^ R' . □ 

We also define block products of pg-pairs. If {S, A) and (T, B) are pg-pairs 
and fc > 0, we define (5, A) Dk {T, B) to be the sub-pg-pair of 5 Dfc T generated 
by those pairs (^,17) such that for some n > 0, g G Bn and F{c) G A„ for each 

c G 4,n- 

Let (5, A) and (T, B) be p.g-pairs and let a: AM S and (3: BM ^ T be 
the natural morphisms, so that a{a) ~ a and /3(6) = h for all a G A and b € B. 
Let (t/, E) = {S,A) Dk iT,B), and let ip:T,M U C S Dk T he the natural 
morphism. By definition, each <t g S„ (n > 0) is a pair = (Fc&o-) with 
60- G Bn and F^ G An'"' ■ Let tt: EM ^ BM be the morphism induced by the 
second component projection from E to B, and let r = /? o tt: SAf ^ T, see 
Figure m We now describe a way of computing ip{t) for a tree t G EM„, say 
^{t) ^ iQt,Tit)). 
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EM ► BM 




U 



Figure 9: The morphisms tp, tt and P 

Let D = {u,ki,v,k2) G Ik.n- We define the tree tu by relabehng the nodes 
of t in NV(t) with elements of A as follows. Let a; be a node of t of rank m > 0, 
let g be the subtree of t whose root is x, and let hi,. . . ,h„i be the subtrees 
whose roots are the children of x, see Figure [TUl Let ft, = /ii © • • • let 
a e Sm be the label of x in t and let r2 > be such that g ~ a ■ h E SAf^a and 
h e SAf„.,. 



cr 




Figure 10: The trees t and g = a ■ h, and the context C 

Let us write t = / • (ri ® g © fa), where ri and ra arc integers such that the 
node X is now labeled by a variable in / (that is, \eftrj^{x) and right„_^^_|_]^(a;) 
in i, and n = ri + r2 + J'a). Let ui be the ©-sum of the first ri Wj's, W2 
the ©-sum of the next r2 Vj's and V3 the ©-sum of the last Wj's. Then 
we have vi S Tr^^p^, V2 G Tra^pa ^-iid ^3 G 1^3,^3 for some pi,P2,P3 > (and 
/c = fci +pi +P2 +P3 + k2). Let then c w • (ki © r(/) • (wi © 1 © W3) © k2), so 
that C = {c,ki + pi,T{h) • V2,P3 + ^2) £ /fc,™, see Figure [TOl We finally label 
the node x in tjj by Fo.(C). 
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The resulting tree to is an element of AM„. We now show the following 
fact. 

Fact 3.2 With the notation above, ip(t) = {Qt,T{t)) where Qt{D) = a(f£)) for 
each context D G Ik,n- 

Proof. The proof is by structural induction on t. If t = 1, then Id = 1 for 
each D, and if{t) = (_Fi, 1), so the announced result holds. 

If t consists of a single node, then i = cr e Sq and ip{t) = {Fcy,bcr)- Now 
let D = {u,ki,0,k2) G Ik.o- With the notation above, we have g — a ~ t, 
h ^ 0, f ~ 1, and pi = rj = 0. In particular, c = u. It follows that C = D 
and to = F„{D). Moreover, since Fc{D) £ A, we have = Fa{D). This 

concludes the verification of the equality for one-node trees. 

Let us now assume that t G T,Mn {n > 0) has more than one node, let 
a G E,n be the label of the root of t, and let s^^\ . . . , s*^™^ be the subtrees of 
t attached to the children of the root. Let also s — s^^^ © ■ • ■ © s''"-', so that 
t = a ■ s. Let D = (u, ki, w, k2) € Ik.n- By induction, we have 

QtiD) = F^{u, ki,T{s) ■ w, fcs) • (a(4'') © ' ' ' ® > 
where u; = -iDi © ■ • ■ © Wn, Ci = (c^, gi, Wi, ^2), 

c, = u • (ki © T{a ■ © • • • © s^*-^' © 1 © © • • • © s^"))) © k2) 

and qi and 92 are appropriate integers (which depend on i) such that 
■u ■ (ki © T{t) • -u; © k2) = Cj • (qi © r(s'')) • w, © q2). 

We compare this value with If a is the label of the root of and if 

di,...,d,„ are the subtrees of i^i attached to the children of the root, then 
Oiito) = a{a) ■ a{di). We first discuss the value of a. With reference to the 
notation in the definition of the labels of to above, since t ~ a ■ s, the integers 
pi,P3, ri, ra are all ec^ual to and vi = V3 = 0. In particular, a ~ Fa-{u, fci, r(s) • 
It;, fc2). Thus a{a) ~ a{F^{u, fci, r(s) • u), ^2)) = Fc{u, fci, r(s) • ui, k2). 

To conclude, we need only to verify that di = s[^^ for each i G [m], that is, 
each node x of t in NV(s(*^), has the same label in di and in But it is easy 
to see that the label of x in both tu and s^q. is of the form Fp(C) where p G S 
is the label of x in s^'^ and C is appropriate. □ 

3.2 Closed pseudovarieties 

We say that a pseudovariety V of preclones is closed if every block product SUkT 
with 5, T G V and fc > belongs to V. Closed pseudovarieties of pg-pairs are 
defined similarly. Since the intersection of a family of closed pseudovarieties is 
closed, there exists a least closed pseudovariety containing any given class K of 
finitary preclones (resp. finitary pg-pairs) . 
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We now give a technical result on closed pseudovarieties, that will be used 
in the proof of our main result. We consider the situation where S,T,T' are 
preclones and T is a sub-prcclonc of T'. Then the elements of S Dk T' whose 
second component belongs to T , form a sub-prcclonc of S'Dfc T' which we denote 

Proposition 3.3 Let Y be a closed pseudovariety of preclones. Let S,T 
and let T' be a finitary preclone such that T is a sub-preclone of T' . For each 
k > 0, the product S T belongs to V. 

Before wc prove Proposition l3.31 we verify a technical lemma. In this lemma, 
we use the notation in the proposition. In particular, we need to consider 
contexts in both Tj, and T^.. We denote by /fc_„ (resp. /[, „) the set of n-ary 
contexts in Tk (resp. T^). 

Lemma 3.4 Let S, T , T' and k be as in Provosition \3.S\ and let C G /(. There 

exists a morphism oP : S T SUnT such that, if {F, /) G {S T)n, then 
a^{F,f) is of the form a'^ {F J) {F^ J) with F'^ {1,0, n,0) ^ F{C). 

Proof. Let C = {u, ki,v, ^2) G „ and let {F, f) G (S'nf m > 0. We first 
define a mapping F'~^:In,m Sm- Let D = {r,pi,s,p2) G In,m- By definition, 
P1+P2 < n, and we let vi be the ©-sum of the first pi Vi^s, V2 be the ©-sum of 
the last p2 Vi^s, and v be the ©-sum of the middle n — pi -~p2 Vi's. In particular, 
there exist integers qi,q,q2 such that vi G Tp^^q^, v G T„-pi-p2,q, V2 G 7p2,g2 
and ki + pi + qi + q + q2 + P2 + k2 = k, see Figure [TT] We let 

F'^{D) = F{u- (ki ©r • {vi © 1 © wa) © k2), /ci -I- s • w, + ^2) • 

The verification that F'-' [1, 0, n, 0) — F{C) is straightforward, and we need to 
show that a'~^ : [F, f ) ^ [F'-^ , f) defines a morphism of preclones. 

Let {F,f) G {S Ul' T)„ and let [G„g,) G (5 nf T)h, e H)- For 
convenience, we let g be the ©-sum of the gi, so g G T„^ ^ with h — hi. Let 
(Q,/ • 9) = {F,f) ■ e,(G„g,), so that {{F, f) • ®^{G^.9^)) = (Q^,/ • q)- 
Moreover, let {R, f ■ g) = {F'^ , f) ■ 0,(Gf , gi). We need to verify that = R. 

Let D = (r,pi, s,p2) G Ih.n- For each i G [m], let Si be the appropriate 
©-sum of Sj 's such that g ■ s ~ gi ■ si (B ■ ■ ■ (B Qm ■ Sm- Then R{D) ^F^{r,pi,g- 
s,P2) -04 Gf(A), where A = {ri,^l,s^,^2), is 

J' • (Pl © / ■ (31 ■ Sl ffi • ■ ■ © 9i-l ■ Si-l © 1 © gi+l ■ Si+l © • ■ • © 5m • Sm) © P2), 

and £1, ^2 are such that r • (pi (B f ■ g ■ s (B P2) = ri ■ {li (B gi ■ Si(B (2)- 
Thus R{D) is the composition of 

F{u ■ {ki ® r ■ {vi B I <B V2) ®'k2), ki + Pi, g ■ s ■ v,P2 + ^2) 

with 

m 

(u • (ki © r, • (w^ © 1 © wJ,) © k2), fci -(- £1, g • s • w\ 4 4- fc2) , 

4=1 
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qi / m \ 92 



Figure 11: F'~^{D) is the image by F of the context represented here 



where i; = ® © ^2 is the appropriate grouping. Observe that 
is equal to 

u-(ki©pi©/-(5i-si©---©5,_i-s,_i©l©.g,+i-s,+i©---©g„-s™)©P2©k2). 

Now, Q^{D) = Q(u-(ki©r-(?;i©l©W2)©k2),fci+pi,s-v,p2 + fc2). By 
definition of the block product, this is equal to the composition of 

F{u • (ki ©r • [vi © 1 © W2) ©k2),A;i + ■ s • v,p2 + ^2) 



with 



i=l 



where v = 0^ v'^ is the appropriate grouping, 

r[ = It • (ki © r • (wi © / • (51 • si • wi © • • • © • Si-i ■ © 1 

©5i+l • Si+i • Ui+i © • • • © 5m • Sm • © U2) © k2) 

where v = wi © • • ■ © w,„ and zi and Z2 are appropriate integers. 

It is now a straightforward verification that R{D) = Q'-"{D), which concludes 
the proof. □ 
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Proof of Proposition 13.31 By Proposition 11.51 it suffices to verify that dis- 
tinct elements of equal rank in S T can be separated by a morphism into an 
element of V. 

So we consider (Fi, /i) and (F2, /2), rank n elements of 5* T. If /i ^ /2, 
the second component projection is a morphism into T G V which separates 
the two elements. If /i ~ /2, then Fi ^ F2 and we let C £ I^. ^ such that 
Fi(C) 7^ F2{C). Then the morphism a*^ in Lemma 13.41 is a morphism into 
S* □„ T G V which separates the two elements. □ 

If {S,A) and {T',B') are pg-pairs and (T, B) is a sub-pg-pair of {T',B'), 
recall that the block product {S, A) (T', B') is generated by a ranked alphabet 

E' such that = A^t"" X B;„. We let (5, A) D^^'"^'^ (T, B) be the sub-pg-pair 

of {S, A) Dk (T', B') generated by the subset E of E' such that E„i = A,™ x 
for each m. 

Proposition 3.5 Let \ be a closed pseudovariety of pg -pairs. Let {S^A) and 
{T,B) be pg-pairs in V and let {T',B') be a finitary pg-pair such that (T,B) 

is a sub-pg-pair of {T',B'). For each k > 0, the product {S,A) □[.^ {T,B) 
belongs to V. 

Proof. We note that the morphism in the proof of Lemma 13. 4[ maps each 

generator of {S,A) □i^''^''' {T,B) to a generator of {S,A) □„ (T,B), so a'~^ is 
also a morphism of pg-pairs between these block products. 

The same scheme as in the proof of Proposition 13.31 can then be applied, 
using Proposition 11.61 instead of Proposition 1 1.51 □ 

We conclude with a result on full pseudovarieties (see Section II. 4p . 

Proposition 3.6 Let W be a pseudovariety of preclones, let V = pgp(W) and 
let W and V be the closure of W and V respectively. Then V and V are full 
and V = pgp(W). 

Proof. In view of Proposition fL^ it suffices to show that V = pgp(W). We first 
verify that if {S, A) E V, then 5* G W, by induction on the construction of (S, A) 
from elements of V by means of block products. If {S, A) G V, then G W 
by definition and hence, S G W. Now suppose that {S,A) < (S^^^A^^)) Dk 
(S'(2),^(2)) and S'(^),S'(2) G W. By definition of the block product of p.g-pairs, 
5* < S'(i) Ok S-^^) and hence S &W. _ 

Now we show that if {S, A) is a finitary pg-pair with 5 G W, then {S, A) G V. 
The proof is by induction on the construction of S from elements of W by means 
of block products. If 5 G W, then {S, A) G pgp(W) = V by definition. Now 
suppose that S S^^^ Dfe S''^' and pgp(S'('') C V. Let B*^) be the projection of 
A onto Each element of A is of the form (F, b) with b G B^'^K Let B^^) be 
the imion of the ranges of the first components of elements of A. Then, if T^*) 
is the sub-preclone of S'^'^ generated by B^'\ we have {S,A) C (T^^), A^^)) Dk 
(T(2), yl(2)). It follows that {S, A) G V. □ 
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3.3 Characterizing CindiJC) 

Our main result is: 

Theorem 3.7 Let K. he a class of recognizable tree languages such that each 
quotient of a language in K- belongs to Cind{lC) and such that Lind(/C) admits 
relativization. Then a language is in Cind{JC) if and only if its syntactic pg- 
pair belongs to the least closed pseudovariety of pg-pairs containing the syntactic 
pg -pairs of the languages in K.. 

The proof of Theorem 13.71 is based on Propositions 13.81 and 13.91 below. Ap- 
plications are considered in the next section. 

Proposition 3.8 Let K, be a class of recognizable tree languages such that 
Lind(/C) admits relativization and let {S,A) and (T,B) be pg-pairs such that 
every language recognizable by {S,A) or {T,B) belongs to Cind{IC). Then every 
language recognizable by a block product of (S, A) and (T, B) also belongs to 
Cind(lC). 

Proof. Let A: > 0, (C/, E) = {S,A) Uk {T,B), and tp: ii:M,'£) (C/, E) be the 
morphism induced by the identity map of E. Let L be a tree language recognized 
by a morphism ip': (E'Af , E') (C/, E). Since p is onto and (E'M, E') is free, 
there exists a morphism tp: (E'A/, E') (EA/, E) such that if' = p o ijj. In 
particular, L = (^'"^((^'(L)) = ip-'^ {ip-'^ [if' [L))) . In view of Theorem [2171 it 
suffices to show that every language recognized by lies in Cind[lC). This in 
turn reduces to showing that ip~^{F,g) e £ind{IC) for each {F,g) g U. 

We use the information obtained in Fact 13.21 on the computation of ip{t). 
As in that statement, we let a: AM S and [3: BM ^ T be the natural 
morphisms, we let tt: EA/ BM be the morphism induced by the second 
coordinate projection from E to B. and we let r = /3 o tt: EA/ — > T. For each 
cr e E, we let ip{a) = {Fa,bcr)- 

Let {F,g) e Un- For each t S EAf„, we have pit) — {Qt,T{t)), where 
Qt G "S'n"'* is described in Fact 13.21 We note that T~^{g) is recognized by 
{T,B), and hence is in Cind{IC). We denote by Xg a rank n Lind(/C)-sentence 
defining T~^{g). 

Recall (from Fact [3?2|) that Qt{D) = for ah D G 4,„. For each s € S", 

the tree language Kg = a~^{s) is recognized by {S, A), and hence Kg G Cind{IC). 
It now suffices to show that, for each s G Sn and D G /fc,„, there exists a rank 
n Lind(/C)-scntence ips.D defining the language 

{t G EAf„ ItoeKs}. 

Indeed, since I^.n is finite, it will follow that ip^^{F, g) is defined by the conjunc- 
tion of Xg and the "03, d {F) G /fc,™ and F{D) = s). We construct the sentence 
tjjs,D in the form ips.D = Qk,z ■ {ilia)a^A (where is a rank n Lind(AI!)-formula 
on E depending on a and D). (The formula i!s,D is actually a Lind(£mc?(/C))- 
formula but this is sufficient for our purpose in view of Theorem 12.151 (3).) 
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Let a € Am and D = (it, fci, w, ^2) G Ik.n- For each 0<i<j<n+l, 
let vi = ®l^iVq, V2 = ©^Ij+i^g and V3 = ®g^jVq; and let pi, p2, Pa be the 
ranks of vi, V2 and V3 respectively. For each such for m > 0, u G E^, 
+P3+fc2i ci © • ■ • © Cm G T„i.p2, we let 

V'' = Pa(^;)Aleft,(z)Aright^-(z)AXc[y^2]A /\ XcJ> a;^] 

and we let 4'a{z) be the (finite) disjunction of the '^''{z) when 

F„{u • (ki ©c • (wi © 1 ©U3) ©k2),fci (ci © • ■ • ©Cm) • U2,P3 + ^2) = fl- 
it is elementary to verify that (t, z 1— s- x) ^ V'a if and only if node a; is labeled a 
in tD- Thus t satisfies '4's,d if and only iito S Kg, if and only if Q{t) — aijo) G 
a(i^s) = {s}, which concludes the proof. □ 

Proposition 3.9 Let Y he a set of first order variables and let y ^ Y . Let 

{ips)s£A be a family of rank k hind-formulas over S, with free variables in 
Y U {y}, deterministic with respect to y, let K C AA/fe be a tree language and 
let LP = Qxy ■ {^5)s€A- 

Let (S*, A) be a pg-pair recognizing K , and let (T, B) be a pg-pair recognizing 
simultaneously the languages L^p^ C TiY\j{y]Mk (i5 G AJ. Then the language L^ 
(a subset ofEyMk) is recognized by {S,A) Dk (T,B). 

Proof. Let k: (AM, A) — > {S, A) be an onto morphism recognizing K, and let 
r: (EyujyjM, S-jAu^y}) —^ {T,B) be an onto morphism recognizing each of the 
L^s G A). 

We observe the following: if t € SyujyjMfc is a (F U {y})-structure and y 
occurs in the label of a rank n node, then there exists a unique S G A„ such that 
T{t) G t{Lip^). Indeed, the determinism of {(ps)5 with respect to y shows that t 
lies in exactly one of the L^^ {S G A„): by hypothesis, r~^(T(iy_)) = L^^ for 
each e, so rit) G t{L^J and e G A„ imphcs e = S. 

Wc consider the block product of pg-pairs {S, A) Dk (T, B). For each cr G I]„ 
and Z C y (so that (cr, Z) G (Sy)„), we let -fia,Z) = {F^^z,T{a, Z)), where 
Fc^z is defined as follows. Let {u,ki,v,k2) G Ik,n- If w- (ki©T(CT, ZU{y})-wffik2) 
is the T-image of some (Y U {y})-structure, then we let F^^ziu, ki,v, k2) = n{5) 
where 5 G A„ is uniquely determined by the property t{u ■ (ki © (cr, Z U {?/}) • v © 
1^2)) G t{Lip^). Otherwise, we choose fcr,z(u, fci, f , ^2) arbitrarily in An. Note 
that 7(cr, Z) lies in the generator set of the pg-pair (5*, A) Uk {T, B). 

Let now t G EyMfc and let D = (1, 0, k, 0) G Ik,k- Then -/(t) = (Q, r(t)), and 
Q{D) = a{tD), where a: AM — > S* is the natural morphism and to is described 
in Fact 13.21 In particular, let x G N V(t) be a rank n node, labeled by (cr, Z) in 
t and let t factor as t = / ■ (ri ffi (a, Z) • /i ffi ra) where /, ri and r^ are such 
that (cr, Z) ■ h is the subtree of t rooted at node x. The label of x in is equal 
to Fc^z{T{f), ri, T{h), r2), for the computation of which we need to consider the 
tree t(/ ■ (ri © (cr, Z U {y}) ■ h © ra)), that is, r(i'), where t' is equal to t with 
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the label of x changed to (cr, Z U {y}). Note that t' is a (F U {y})-structure, so 
X is labeled by k{6) {5 e A„) in to if and only if r(<') g L^^. 

Going back to the definition of the interpretation of Lindstrom quantifiers, 
this shows that t G Lip if and only if to € K. As a result, L^p ~ j^^{P) where 
P consists of the pairs {F, f) such that i^(l,0, k, 0) £ ce{K), and hence is 
recognized by {S, A) Ok (T, B). □ 



We are now ready to complete the proof of Theorem 13.71 

Proof of Theorem 13.71 Let K be the class of syntactic pg-pairs of the ele- 
ments of K., let V be the pseudovariety of pg-pairs generated by K, and let V 
be the least closed pseudovariety containing V. We first show that if L is a 
tree language with syntactic p^-pair {S, A) e V, then L G Cind{IC). In view of 
Proposition II. 7i {S,A) can be obtained from elements of K by a succession of 
operations consisting of taking cither a sub-pg-pair, a quotient, a direct product 
or a block product. We let ^{S,A) be the least number of such operations, and 
we proceed by induction on ^{S,A). 

If 'tis, A) = 0, then {S,A) e K, that is, (S.A) is the range of the syntactic 
morphism ip: (EM, E) — * (S, A) of a language K C EM^ in IC. We want to show 
that every language recognized by {S, A) is in Cind{IC) . As in the first lines of 
the proof of Proposition 13. 8[ this reduces to showing that for each s G S, we 
have Lp^^{s) G Cind{IC). Now we deduce from Remark ll.lOl tliat 

^"'(«) = f]i{u,h,k2)-'K)v-'\\J{{u,k,,k2)-'K)v~\ 

where the intersection runs over all n-ary contexts (u, fci, w, ^2) over EMfc such 
that (^{u,ki,k2)~^K^v~^ meets ip~^{s), and the union over the n-ary contexts 
that do not. Moreover, by Remark 1 1 . 1 01 again, this union and this intersection 
are finite. It follows from Theorem 12.171 that (p~^{s), and hence any language 
recognized by {S,A) lies in Cind{JC). 

We now suppose that '^{S, A) > 0. If {S, A) is a sub-pg-pair or a quotient of 
ap.g-pair {T,B) e V with tl(T,5) < ]^{S,A), Proposition [Ql establishes that L 
is also recognized by {T,B), so every such language is in Cind{IC) by induction 
hypothesis. If {S, A) is the direct product of pg-pairs (T, B) and (T', B') with 
lesser jj-values, then by a standard argument, every language recognized by 
{S, A) is a finite union of intersections of the form LCiL' , where L is recognized 
by {T,B) and L' by {T',B'). In particular, such a language is in Cind{lC) by 
Theorem 12. 171 If on the other hand, {S,A) divides a block product of pg-pairs 
with lesser jj- values, the inductive step follows directly from Proposition 13. 81 

This concludes the proof that every tree language recognized by a p^-pair in 
V is in Cind{lC). We now turn to the converse, namely showing that any tree 
language defined by a Lind(AI!)-sentcncc has its syntactic pg-pair in V. 

This is implied by the following, more precise statement: if iy9 is a rank k 
Lind(AI!)-formula with free variables in a finite set Y , then is recognized 
by a morphism a: (SyM, Ey) — > {S^A) such that Im.qj{a) S V, where Xm0(a) 
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denotes the sub-pg-pair of {S, A) generated by a^E) (recall that S is identified 
with the subset E x {0} of Sy). 

We prove this statement by structural induction on ip. If (p is an atomic 
formula and a is the syntactic morphism of L^, then Xiaijj^a) is trivial by 
Example I2.14[ and hence lies in V. If ip =: tpi \/ ip2, then by induction hy- 
pothesis, there exist morphisms 0;^: (EyAf, Sy) — > (5*^, Ai) recognizing L^p. with 
Im0(ai) G V, i = 1,2. It is immediate that Ly, is recognizable by the target 
tupling a = (ai, q;2): (EyM, Ey) ((51,^1) x (52,^2)), and that Im0(a) is 
a sub-pij-pair of the direct product Tm0(ai) x Xm0(a2)- Since V is a pseu- 
dovariety, it follows that Tm(^{a) is in V. The case where ip is of the form 
(p = -1(^1, is also easily treated: any morphism recognizing L^p-^ also recognizes 
its complement, namely L^. 

Finally, suppose that tp is of the form ip = QkU • {^s)s<£Ai where K C AMk 
is recognized by some {S,A) G K. Without loss of generahty, we may assume 
that y ^ Y. By induction, each L^p^ {S G A) is recognized by a morphism 
Ps such that Xin(i,{Ps) G V. Taking the target tupling of the (3s, we construct 
a morphism /3: (Eyufy}^^) Syu{y}) ^ {T',B') recognizing each L^^ and such 
that Tm0(/3) G V (since Xm0(/3) is a sub-p(/-pair of the direct product of the 
Xm0(/35)). By Proposition 13.91 (and its proof), we find that is recognized 
by a morphism 7:(EyA/, Ey) {S^A) Uk {T',B'), where the composition 
TT o 7: (EyM, Ey) {T',B') agrees with /? (here tt is the second component 
projection). In particular, Zm0(7r o 7) is contained in Zm0(/3). Thus Zm0(7) 
is a sub-pg-pair of iS,A) D^"^ ^ Im0(/3), and hence Xm0(7) G V by Proposi- 
tion [331 □ 



3.4 Applications 

The first result follows directly from Theorem 13.71 and Proposition 13.61 

Theorem 3.10 Let K, he a class of recognizable tree languages such that each 
quotient of a language of JC is in Cind(IC) and such that Lind(/C) admits rela- 
tivization. LefV be the least pseudovariety of pg -pairs containing the syntactic 
pg-pairs of elements of K, and let~V be the least closed pseudovariety containing 
V. The following holds. 

• Cind{IC) is a literal variety of recognizable tree languages, associated with 
the pseudovariety V in the Eilenberg correspondence (Theorem \l.ll\) . 

• If K, is the class of languages recognized by a class L of preclones, then 
Cind{lC) is a variety of recognizable tree languages. Moreover, if W is 
the pseudovariety of preclones generated by L and W is the least closed 
pseudovariety of preclones containing W, then W is the pseudovariety of 
preclones associated with Cind{1C). 
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Proof. Theoreni l3 . 7l shows that £md(/C) consists of recognizable languages and 
Theorem 12 . 1 71 then shows that it is a literal variety. Let X be the pseudovariety 
of pg-pairs associated with Cind{lC) by Theorem 11.111 Then X and V contain 
the same syntactic pg-pairs by Theorem 13. 7[ and Proposition 11.81 shows that 
this implies X = V. This concludes the proof of the first statement. 

We now suppose that /C is the class of tree languages recognized by the 
preclones in a class L. By definition, V = (pgp(L)) and Proposition 11.91 shows 
that V is full and 

V = pgp((precl(V))) = pgp((L)) = pgp(W). 

Proposition 13.61 then shows that V is full and V = pgp(W). Thus Cind{lC) is a 
variety of tree languages (Corollarv ll.l2p and the corresponding pseudovariety 
of preclones is precl(V) = W. □ 

The following statement is an important consequence of Theorem l3.10[ which 
motivated this work. 

Corollary 3.11 Cind{lC^) (that is, the class of FO- definable tree languages) is 
a variety of tree languages and the corresponding variety of preclones is the least 
pseudovariety containing T3 and closed under block product. 

Proof. Note that, by Example 12. 131 Cind{IC^) is the class of FO-definable tree 
languages. Recall that /Cg consists of the language Kk{J) C AMk, where A is 
a ranked Boolean alphabet. 

Now let V3 be the variety of tree languages corresponding to the pseudova- 
riety (Tg) generated by Tg. According to Example 11.131 a language L C SA/j. 
is in Vg if and only if it is a Boolean combination of languages of the form 
E'A/fc, S' C E. Now the complement of S'A/fc in EAf^ is the language of trees 
that contain at least a letter outside E': therefore this complement is the in- 
verse image of Kk{3) in the literal morphism from EM to AM that maps E' 
to {0„ I n > 0} and E \ S' to {In | n > 0}. In view of the closure properties 
of Cind{K.^) (Theorem 12 .171) . it follows that /Cg C Vg C Cind{IC^) and hence 
£md(Vg) = Cind{JCj) by Theorem Eli 

By definition, Vg is a variety and as such, it is closed under taking quotients. 
The corresponding logic Lind(Vg) is equivalent to Lind(/Cg) (since Cind{V^) = 
£mc?(/Cg)) and hence it admits rclativization by Corollarv l2.191 Thus we can 
apply Theorem 13. 101 to conclude the proof. □ 

A similar reasoning, using both Tg and the preclones Tp (see Section [TT2] and 
Examples [L4 l \2J0\ [2T3l and Corollary EHH) , yields the fohowing result. 

Corollary 3.12 The class of{FO+ MOD) -definable tree languages is a variety 
of tree languages and the corresponding variety of preclones is the least pseu- 
dovariety containing Tg and the Tp (p>2) and closed under block product. 
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Conclusion 



Wc reduced the characterization of the expressive power of certain naturally de- 
fined logics on trees, a chief example of which is given by first-order sentences, 
to an algebraic problem. This algebraic problem is set in a new algebraic frame- 
work, that of preclones, which the authors introduced in |18] precisely for the 
purpose of discussing tree languages. It is worth stating again that the notion 
of algebraic recognizability resulting from this new framework coincides with 
the usual one: we simply gave ourselves a richer algebraic set-up to classify 
recognizable tree languages. 

Our result does not yield (yet?) a decidability result for, say, first-order 
definable tree languages, but we can now look for a solution of this problem 
based on the methods of algebra. In this process, it will probably be necessary 
to develop the structure theory of preclones, to get more precise results on the 
block product operation. 

A positive aspect of our approach is its generality: it is not restricted to the 
characterization of logics based on the use of Lindstrom quantifiers, nor indeed 
to the characterization of logics. Our key algebraic tool is the block product: this 
product was introduced by Rhodes and Tilson [28| for monoids, to investigate 
the lattice of pseudovarieties of monoids and its application to the theory of 
formal languages (of finite words), and we adapted its definition for preclones. 
The use of wreath products instead of block products (the wreath product can 
be seen as a one-sided restriction of the block product) can yield algebraic 
characterizations for other natural classes of recognizable tree languages, see 

Our approach also raises a number of questions. At a technical level first: 
it was shown in [28j that for monoids, the block product can be expressed in 
terms of a double semidirect product, a two-sided generalization of the semidi- 
rect product. It might be convenient to have such a notion for preclones as well, 
and to derive analogues of the wreath product principle and the block product 
principle (general descriptions of the languages recognized by a wreath product 
or a block product). This might yield, as in the finite word case, the charac- 
terization of the recognizing power of the block product of two varieties, the 
characterization of logical hierarchies within FO, etc. 

At a more general level, we observe that in the word language case, the 
decidability of first-order definability does not stem from the analogue of our 
main result, namely the fact that a language is FO-definable if and only if its 
syntactic monoid is in the least pseudovariety containing {0, 1} and closed un- 
der block product. It follows rather from the characterization of that class of 
monoids as the aperiodic monoids, see the theorems of McNaughton and Papert 
on the equivalence of FO-definability and star-freeness, and of Schiitzenberger 
on the equivalence between star-freeness and aperiodicity. This characterization 
makes use in an essential way of the notion of star-freeness and of the struc- 
ture theory of finite monoids. The question is therefore whether we can find 
a useful analogue of star-freeness for tree languages. There were attempts in 
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this direction ( [121 [2Zl ) that estabhshed that the more natural notions of 
star-freeness for trees do not coincide with FO-definabihty. Are we missing on 
an important concept? Taking the question from a different angle, can we di- 
rectly develop the relevant fragment of a structure theory of finitary preclones, 
to prove decidability of FO-definability? 

Another, more general remark is the following. We are convinced that the 
algebraic concept of preclones is well suited for the study and the classification 
of recognizable languages of finite ranked trees. However, we are conscious that 
beyond its qualities (the first of which is to allow results such as those proved in 
this paper), our algebraic framework is cumbersome, and perhaps intimidating. 
We argued in [TS] that several known results on the characterization of particular 
classes of tree languages can be expressed in a natural way in the language of 
preclones, — but there might be an equivalent, yet lighter algebraic set-up. 

This remark is related with another question. Other algebraic frameworks 
have been investigated in the literature since our results were announced in 
2003 [T7] , in the (considerable) interval it took for this paper to be written and 
refereed. One of the more promising and elegant is the notion of forest algebras, 
introduced by Bojahczyk and Walukiewicz [8|, initially to discuss languages of 
unranked, unordered trees. These authors also achieved the algebraic charac- 
terization of certain logically defined tree languages. More recent papers record 
interesting results using forest algebras, also to discuss ranked or ordered trees 
(e.g. Bojahczyk, Segoufin, Straubing, Walukiewicz dHHISI), and it is tempt- 
ing to wonder whether this algebraic approach and ours could be unified, since 
forest algebras may be seen as an unsorted version of preclones. 

Possibly as a longer term project, one should consider the following. From 
the point of view of applications (in the field of verification, the investigation 
of distributed computation models, of game theory, etc), being able to handle 
languages of infinite (ranked ordered) trees is important. Discussing languages 
of infinite words as well as languages of finite words, was a topic of interest 
from the very beginnings of automata theory (Biichi), automata models were 
proposed quite early on, but the development of an algebraic model to handle 
them (namely the notion of w-semigroups) was very slow in coming, and was 
matured only in the late f980s (through work of Arnold, Perrin, Pin, Wilke, 
etc, see [21]). Can an analogous extension be developed for our preclones? One 
key technical tool in dealing with recognizable languages of infinite words is 
Ramsey's theorem, and the authors are unfortunately not aware of a relevant 
analogue of this theorem for trees. 
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